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Abstract 

Random fc-SAT is the single most intensely studied example of a random constraint satisfaction problem. But 
despite substantial progress over the past decade, the threshold for the existence of satisfying assignments is not 
known precisely for any fc > 3. The best current results, based on the second moment method, yield upper and 
lower bounds that differ by an additive k ■ a term that is unbounded in k (Achlioptas, Peres: STOC 2003). The 
basic reason for this gap is the inherent asymmetry of the Boolean value 'true' and 'false' in contrast to the perfect 
symmetry, e.g., among the various colors in a graph coloring problem. Here we develop a new asymmetric second 
moment method that allows us to tackle this issue head on for the first time in the theory of random CSPs. This 
technique enables us to compute the fc-SAT threshold up to an additive In 2 — i + 0(l/fc) ~ 0.19. Independently of 
the rigorous work, physicists have developed a sophisticated but non-rigorous technique called the "cavity method" 
for the study of random CSPs (Mezard, Parisi, Zecchina: Science 2002). Our result matches the best bound that 
can be obtained from the so-called "replica symmetric" version of the cavity method, and indeed our proof directly 
harnesses parts of the physics calculations. 

1 Introduction 

Since the early 2000s physicists have developed a sophisticated but highly non-rigorous technique called the "cavity 
method" for the study of random constraint satisfaction problems. This method allowed them to put forward a very 
detailed conjectured picture according to which various phase transitions affect both computational and structural 
properties of random CSPs. In addition, the cavity method has inspired new message passing algorithms called Be- 
lief/Survey Propagation guided decimation. Over the past few years there has been significant progress in turning bits 
and pieces of the physics picture into rigorous theorems. Examples include results on the interpolation method 12], 'TI 
or the geometry of the solution space lfn i28ll29ll and their algorithmic implications |[3]|9|. 

In spite of this progress, substantial gaps remain. Perhaps most importantly, in most random CSPs the threshold for 
the existence of solutions is not known precisely. In the relatively simple case of the random fc-NAESAT ("Not- All- 
Equal-Satisfiability") problem the difference between the best current lower and upper bounds is as tiny as 2^^^^'^) IfTTI . 
By contrast, in random graph fc-coloring, a problem already studied by Erdos and Renyi in the 1960s, the best current 
bounds differ by 0(ln k) fS^. Hence, the difference is imboimded in terms of the number of colors. Even worse, in 
random fc-SAT the gap is as big as Q{k) 161. Yet random fc-SAT is probably the single most important example of a 
random CSP, not least due to the great amount of experimental and algorithmic work conducted on it (e.g., Il22ll24l ). 

The reason for the large gap in random fc-SAT is that the satisfiability problem lacks a certain symmetry property. 
This property is vital to the current rigorous proof methods, particularly the second moment method, on which most 
of the previous work is based (e.g., (|4]|5]|6l). More precisely, in random graph coloring the different colors all play 
the exact same role: for any proper coloring of a graph, another proper coloring can be obtained by simply permuting 
the color classes (e.g., color all red vertices blue and vice versa). Similarly, in fc-NAESAT, where the requirement is 
that in each clause at least one literal must be true and at least one false, the binary inverse of any NAE-solution is a 
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NAE-solution as well. By contrast, in fc-SAT there is an inherent asymmetry between the Boolean values 'true' and 
'false'. 

As has been noticed in prior work ||4] |6l, the second moment method is fundamentally ill-posed to deal with 
such asymmetries. Roughly speaking, the second moment method is based on the assumption that in a random CSP 
instance, two randomly chosen solutions are perfectly uncorrected. But in random fc-SAT, this is simply not the case. 
Indeed, suppose that a variable x appears much more often positively than negatively throughout the formula. Then 
it seems reasonable to expect that most satisfying assignments set x to 'true', thereby satisfying all clauses where x 
appears positively. More generally, define the majority vote a„iaj to be the assignment that sets variable x to true if it 
appears more often positively than negatively, and to false otherwise. Then we expect that the satisfying assignments 
of a random formula "gravitate toward" amaj- Unfortunately, the correlations among satisfying assignments induced 
by this drift toward amaj doom the second moment method. Previously this issue was sidestepped by symmetrizing 
the problem artificially (|4l|6l. But this inevitably leaves a 8(fc) gap. 

The main contribution of the present work is a new asymmetric second moment method that enables us to tackle 
this problem head on. A key feature of this method is that we harness the Belief Propagation calculation from physics, 
called the "replica symmetric case" of the cavity method in physics jargon. We are going to employ Belief Propagation 
directly as an "educated guess" in the design the random variable upon which our proof is based in order to quantify 
how much a typical satisfying assignment leans toward amaj ■ 

This is in contrast to most prior work on the subject, where individual statements hypothesized on the basis of 
physics arguments were proved via completely different methods (with the notable exception of the interpolation 
technique ||2l|71[l7l)- Hence, we view the present work as a pivotal step in the long-term effort of providing a rigorous 
foundation for the physicists' cavity method. In fact, the general approach developed here does not hinge on particular 
properties of the fc-SAT problem, and thus we expect that the technique will extend to other asymmetric problems as 
well. Examples include not only other random CSPs that are asymmetric per se, but also instances of random problems 
that arise at intermediate steps of message passing algorithms such as Belief/Survey Propagation guided decimation, 
even if the initial problem is symmetric. In particular, we believe that getting a handle on asymmetric problems is a 
necessary step to analyze such message passing algorithms accurately. 

To state our results precisely, we let fc > 3, n > be integers and we let V = {xi , . . . , x„} be a set of n Boolean 
variables. Further, let 4> = ^k{n, denote a Boolean formula with m clauses of length fc over the variables V 
chosen uniformly at random among all [2n)^'^ such formulas. Let r = m/n denote the density. We say that an event 
occurs with high probability ('w.h.p.') if its probability tends to 1 as n ^ oo. 

Friedgut ifTSll showed that for any fc > 3 there exists a threshold sequenc^ 7'fe-sAT(?^) such that for any (fixed) 
e > w.h.p. $ is satisfiable if m/n < (1 — e)rfc_sAT(?^), while for m/n > (1 + £)rfe_sAT('^) * is unsatisfiable 
w.h.p. 

Upper bounds on r^-sAT can be obtained via the first moment method. The best current ones lfT7ll23]| are 

rfe-sAT < ruppcr = 2^1n2- (l + ln2)/2 + Ofe(l), (1) 

where Ofc(l) hides a term that tends to for large fc. The best prior lower bound is due to Achlioptas and Peres ||6l, 
who used a "symmetric" second moment argument to show 

r-fe-SAT > r-bai = 2'^- ln2 - fc • - (^1 + + 0^(1). (2) 

The bounds ([l]) and ^ leave an additive gap of fc • ^^ + ^ + oa;(1), i-C-, the gap is unbounded in terms of fc. 
Theorem 1.1 There is Sk = Ofc(l) such that 

rfc-sAT > rep = 2^- In 2 - - eu- (3) 

Achlioptas and Peres asked whether the gap Tuppor — ''fc-SAT is bounded by an absolute constant (independent of 
fc). Theorem I 1 . 1 1 answers this question, reducing the gap to In 2 — \ w 0.19. No attempt at optimizing the error term 
£k has been made, but our proofs yield rather directly that Ek = 0(1 / fc). 



'it is widely conjecture but as yet unproved tliat r'fe— sat('^) converges for any fc > 3. 
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Apart from the quantitative improvement, the main point of this paper is that we manage to solve the problem 
of asymmetry in random CSPs for the first time. To explain this point, we start by discussing what we mean by 
asymmetry and how it derails the second moment method. That this is so was already intuited in ||4]|6l. In the next 
section, we are going to verify and elaborate on those discussions. 



2 Asymmetry and the second moment method 

The second moment method. In general, the second moment method works as follows. Suppose that Z = Z{^) is a 
non-negative random variable such that Z > Q only if $ is satisfiable. Moreover, suppose that for some density ?■ > 
there is a number C = C{k) > that may depend on k but not on n such that 

< E [Z^] < C • E [Zf . (4) 

We claim that then rfe_sAT > f. Indeed, the Paley-Zygmund inequality 

E \Z]^ 

P[^>0]>^ (5) 

implies that P is satisfiable] > P [Z > 0] > 1/C. Because the right hand side remains bounded away from as 
n — > oo, the following simple consequence of Friedgut's sharp threshold result implies r/;_sAT > 

Lemma 2.1 ( lITSl ) Let k > 3. If for some r we have 

liminf P is satisfiable] > 0, 

n— >oo 

then Tk-sAT >r-o{l). 

Hence, we "just" need to find a random variable that satisfies (|5j. Let S{^) denote the set of satisfying assign- 
ments; then certainly Z ~ \S{^) \ is the most obvious choice. However, this "vanilla" second moment argument turns 
out to fail spectacularly. We need to understand why. 

Asymmetry and the majority vote. The origin of the problem is that fc-SAT is asymmetric in the following sense. 
Suppose that all we know about the random formula $ is for each variable x the number of times that x appears 
as a positive literal in the formula, and the number d-.^ of negative occurrences. Then our best stab at constructing a 
satisfying assignment seems to be the "majority vote" assigment a„iaj where we set x to true if dx > d-,x and to false 
otherwise. Indeed, by maximizing the total number of true literal occurrences, of which a satisfying assignment must 
put one in every clause, (Jmaj also maximizes the probability of being satisfiable. 

Our proof of Theorem 11.11 allows us to formalize this observation, thereby verifying a conjecture from |f6l|. Let 
dist(-, •) denote the Hamming distance. 



Corollary 2.2 There is a number 6 = 6{k) > such that for 2^ /k < r < tbp w.h.p. we have 

tjcr, <T; 



Hence, the average Hamming distance of a E from (J„iaj is strictly smaller than n/2, i.e., the set S{^) is 

"skewed toward" amaj w.h.p. 

This asymmetry dooms the second moment method. To see why, let 

fif.\ ^ max {dx,d^x} 
km 

xev 

denote the majority weight of ^. Then the larger Wmaj, the more likely amaj and assignments close to it are to be 
satisfying. In effect, the bigger w,naj, the more satisfying assignments we expect to have. The consequence of this 
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is that the number |5(#)| of satisfying assignments behaves like a "lottery": its expectation is driven up by a tiny 
fraction of "lucky" formulas with w,naj much bigger than expected. 

Let us highlight this tradeoff, as it is characteristic of the kind of trouble that asymmetry causes. For ^ > 
independent of but sufficiently small it turns out that for a certain constant c > 0, 

P [w^^i ^ E [i«„,aj] + e] - cxp [-{c^ + Oie))n\ . (7) 

That is, the probability is exponentially small but, like in the Chernoff bound, the exponent is a quadratic function 
of By comparison, increasing the majority weight by ^ boosts the expected number of satisfying assignments by a 
linear exponential factor: there is c' > such that 

E[|5(*)| |wn,aj-E[u.n,aj]+e] - cxp [(c'^ + 0(e')H • E | | w;„,aj E [w;„,aj]] ■ (8) 

The exponent in ([8]l is linear because for a typical assignment r at distance (i — 6)n from amaj increasing Wmaj by £, 
boosts the number of literals that are true under r by 2d£, ■ km, a term that is linear in ^. 

Since the exponent is linear in (O but quadratic in (|7|, there is a (small but) strictly positive ^ > such that the 
"gain"exp [(c'^ + 0(^^))n] in the expected number of satisfying assignments exceeds the "penalty" exp [— (c^^ + 0{£,^))n\ 
for deviating from E [wmaj]- With little extra work, this observation leads to 

Lemma 2.3 For any fc > 3 and r > 2^^ /k we have 

< exp {-n{A-'') • 7i) • E w.h.p. 

Lemma |23] entails rather easily that the "vanilla" second moment argument fails dramatically. Indeed, as already 
noticed in |l4l|6l, we have E > exp(ri(7i)) • E Hence, we miss our mark (01) by an exponential 

factor But Lemma l23] is witness to an even worse failure: not only does (jUi fail to hold, but even the normally much 
more dependable ^raf moment overshoots the "actual" number of satisfying assignments by an exponential factor! 
(Lemma [2.3l is an improvement of an observation from HI, showing that < cxp(— ^n)E [|iS(^')|] w.h.p. for 

some tiny ^ = ^(fc) > 0; we conjecture that the 4^*^ term in Lemma |23] is tight.) 

In summary, the drift toward a^naj and the resulting fluctuations of the majority weight induce a tremendous source 
of variance, derailing the "vanilla" second moment argument. 

Balanced assignments. A natural way to sidestep this issue is to work with a 'symmetric' subset of 5(4>). Perhaps 
the most obvious choice is the set 5nae(*) of NAE-solutions. In a landmark paper, Achlioptas and Moore Q proved 
that indeed there is C = C{k) > such that for Znae = |'5nae(^)| we have 

E [Z^ae] < • E [Znae]' forr < 2^-^ In 2 - 0^(1). (9) 

As we saw above (cf. Lemma IZTI ). this implies that 7'a;-sat > 2'°"^ In 2 — 0(1). However, a simple (first moment) 
calculation shows that for r > 2''^^ ln2, the set 5nae(*) is empty w.h.p. Thus, the idea of working with NAE- 
solutions stops working at r ~ 2*^"^ In 2, about a factor of two below the satisfiabiUty threshold. 

Achlioptas and Peres fE\ obtained a better bound by precipitating symmetry in a more subtle manner Let us call 
<T e {0, 1}" balanced if under a out of the km literal occurrences in # exactly half are true (i.e., ^ ± 1). Thus, 
balanced assignments are expressly forbidden from pandering to cr„iaj- Now, let 5bai(^) be the set of all balanced 
satisfying assignments, and set Zbai = |5bai(*)|- Achlioptas and Peres used a clever weighting scheme to prove that 

E [Z^ < C • E [Zbai]' for r < rtai (cf. ©). (10) 

As before, this implies that rfe_sAT > ^bai (Lemma [2.1l i. 

Yet as in the case of NAE-solutions, balanced satisfying assignments cease to exist way before the satisfiability 
threshold. Indeed, Achlioptas and Peres observed that iSbai(*) = for r > 2*^' In 2 — fci^ w.h.p. In effect, to close in 
further on rfc_sAT we will have to accommodate assignments that lean toward amaj- How can this be accomplished? 
A quick fix? We saw that to make an asymmetric second moment argument work, we need to rule out fluctuations of 
the majority weight. A sensible way of implementing this is by actually fixing the entire vector d = {dx, d^x)x£V 
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that counts the positively/negatively occurrences of each variable. More precisely, given a non-negative integer vector 
d = {dx, d^x)x£V with J2xev ~^ '^^^ ~ denote a uniformly random fc-CNF in which each variable a; 

appears dx times positively and d^x times negatively. Then we can spht the generation of a random formula $ into 
two steps: 

First, choose the occurrence vector d randomly from the "correct" distribution D. 

Then, choose a random formula 

The "correct" D is as follows. Let e = [cx, e-,x)xi^v be a family of independent Poisson variables with mean 
kr/2 each. Moreover, let 8 be the event that X^^ey + e~,x = km. Let D be the conditional distribution of e given 
8. Then standard arguments show that the outcome of first choosing d and then €>d is exactly the uniformly random 
*. 

The point of generating 4> in two steps as above is that given the outcome d of the first step, the majority weight is 
fixed. Hence, if we could show that given a "typical" d, the second moment succeeds for we would obtain a 

lower bound on ru-sAT- Unfortunately, matters are not so simple. 

Lemma 2.4 W.h.p.fora vector d chosen from D we have E[|iS($d)|^] > exp • E 

Let us stress the two levels of randomness in Lemma [Z4l First, there is the choice of d. Then, for a given d, we 
compare E[|5($d)|^] and E Of course, both of these quantities depend on d, and we find that w.h.p. d is 

such that the first exceeds the second by an exponential factor. 

The explanation for this is that even if we fix d, various other types of fluctuations remain, turning into 
a "lottery". For instance, even given d the number of clauses that are unsatisfied under (Tmaj fluctuates. Hence, the 
inherent asymmetry of fc-S AT puts not only the majority weight but also various other parameters on a slippery slope. 
What we need is a way of controlling all these fluctuations simultaneously. We will present our solution in Section|5] 

Catching the k-SAT threshold? Before we come to that, let us discuss what it would take to eliminate the (small but 
non-zero) gap left by Theorem ll.il i.e., how far we are from "catching" the fc-SAT threshold. The physicists' cavity 
method comes in two installments. The (relatively speaking) simpler "replica symmetric" version is based on Belief 
Propagation. Theorem 11.11 provides a rigorous proof of the best possible bound on the fc-SAT threshold that can be 
obtained from this version of the cavity method (up to possibly the precise error term e^) 1251 . 

Unfortunately, for r > tbp the replica symmetric version (and in particular the Belief Propagation predictions 
that we depend upon) are conjectured to break down. According to the more sophisticated "1-step replica symmetry 
breaking" (IRSB) version of the cavity method, the reason for this is that at r ^ rep a new type of correlation amongst 
satisfying assignments arises. To deal with these correlations, the physics methods replace Belief Propagation by the 
much more intricate Survey Propagation technique. 

In [;1 11 we managed to prove rigorously that the IRSB prediction for the random fc-NAESAT threshold is correct 
(up to an additive 2^^^'^^^). However, ifTTl depends heavily on the fact that fc-NAESAT is symmetric. While it would 
be very interesting to combine the merits of the present paper with those of ifTTI . this appears to be quite challenging. 
Thus, putting the IRSB calculation for random fc-SAT on a rigorous foundation remains an important open problem. 
That said, we believe that any such attempt would need to build upon the techniques developed in this paper. 

3 Related work 

The interest in random fc-SAT originated largely from the experimental observation that there seems to be a sharp 
threshold for satisfiability and, moreover, that for certain densities r < r^^sAT no polynomial time algorithm is 
known to find a satisfying assignment w.h.p. Il22ll24l . Currently, the precise fc-SAT threshold is known (rigorously) 
only in two cases. Chvatal and Reed fE\ and Goerdt 1211 proved independently that r2_sAT = 1- Of course, 2-SAT 
is special because there is a simple criterion for (un)satisfiability, which enables the proofs of |I8]|2T]. Unsurprisingly, 
these methods do not extend to fc > 2. Additionally, the threshold is known precisely when fc > logj n, i.e., the clause 
length diverges as a function of n ll20l . In this case, the problem of asymmetry evaporates because the majority weight 
is sufficiently concentrated for the "vanilla" second moment method to succeed. (Note that Proposition 12 . 3 1 holds for 
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any fixed k, but not for k = k{n) — > oo.) The issue of asymmetry also disappears in the case of strongly regular 
formulas lISTl where for some fixed d we have dx = d-,x = d for all x G V. 

Also in random fc-XORSAT (random linear equations mod 2) the threshold for the existence of solutions is known 
precisely lfT4l . The proof relies on computing the second moment of the number of solutions (after the instance has 
been stripped down to a suitable core). In contrast to random fc-SAT, the random fc-XORSAT problem is symmetric 
(cf. Remark l53] below). albeit in a more subtle way than fc-NAESAT. 

Other problems where the second moment method succeeds are symmetric as well. Pioneering the use of the 
second moment method in random CSPs, Achlioptas and Moore Q computed the random fc-NAESAT threshold 
within an additive 1/2. By enhancing this argument with insights from physics this gap can be narrowed to a mere 
2-n{k) |[YTl[T2l . Moreover, the best current bounds on the random (hyper)graph fc-colorability thresholds are based 
on "vanilla" second moment arguments as well 15] [15]. In summary, in all the previous second moment arguments, 
the issue of asymmetry either did not appear at all by the nature of the problem ||4]|5][TTl[l2l[l4l[l5]|20l, or it was 
sidestepped @. 

The best current algorithms for random fc-SAT find satisfying assignments w.h.p. for densities up to 1.817 • 2'^/fc 
(better for small fc) resp. 2'^ In(fc) / fc (better for large fc) ||9][l9|, a factor of 6(fc/ In fc) below the satisfiability threshold. 
By comparison, the Lovasz Local Lemma and its algorithmic version succeed up to r = 8(2''/fc^) lf30l . 

Apart from experimental work ||24l , very little is known about the physics-inspired message passing algorithms 
("Belief/Survey Propagation guided decimation") |27]. The most basic variant of Belief Propagation guided decima- 
tion is known to fail w.h.p. on random formulas if r > c-2''/k for some constant c > IfTOl . However, it is conceivable 
that Survey Propagation and/or other variants of Belief Propagation perform better. 

4 Preliminaries 

We shall make repeated use of the following local limit theorem for the sums of independent random variables, see lfT6l 
and nn . 

Lemma 4.1 Let Xi, . . . , Xn be independent random variables with support on Nq with probability generating func- 
tion P{z). Let /i = E[Xi] and ~ Xai[Xi\. Assume that P{z) is an entire and aperiodic function. Then, uniformly 
for all Tq < a < Too, where = lim^-j.^; as n ^ oo 

PvlXi + ■■■+ X,, ^ an] = (1 + o(l)) — ( , (11) 



where ^ and ^ are the solutions to the equations 



p^^-^ = a and ^ = (in P{z) - a in z) 



2 = C 



(12) 



Moreover, there is a Sq > such that for all < \6\ < Sq the following holds. If a = E[^i] + Sa, then 



Pr[Xi + . . . + X„ = an] = (1 + 0{S)) ^ ^ ^i-s'' /2+o{S^))n _ ^^3^ 

v27rnfT 

From this we can rather easily derive the following well-known statement about the rate function of the binomial 
distribution. 



Lemma 4.2 Let < p,q < 1. Let 



■4^{p,q) = -^In ( - ) - (1 - q)ln ^ ^ 



If p, q remain fixed as n — > 00, then 

P [Bin(7i,p) = qn] = Q{n^^^'^) exp [4'{p^ q)n] ■ 
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The following form of the chain rule will prove useful. 

Lemma 4.3 Let g : R° — > R** and f : R'' — > R fee of class C^, i.e, with continuous second derivatives. Then for any 
Xq G R° and with i/q = g{xQ) we have for any i,j(z [a] 



d^fog 



dxidxj 



Xf, 



E 

k=l 



5/ 
dyk 



d'gk 



yo dx^dxj 



E 



^0 k.l = l 





dgk 


dgi 




dykdyi 


, dxi 




2:0 



Finally, we need the following version of the inverse function theorem that states under which conditions a given 
system of equations can be solved around a specific point u. A detailed exposition can be found in 1321 . 

Lemma 4.4 Let U C R'* be open and let f (U). Assume that u E U and A > are such that 

{x G R'' : -it||2 < X} C U. 

Let Df{x) be the Jacobian matrix of f at x, id the identity matrix, and \\-\\ denote the operator norm over L^(R''). 
Assume that Df{u) = id and 



\\Df{x) — id|| < — for all x G R'' such that \\x ~ uW^ < A, 

o 

Then for each y G R'' such that \\y — /(u)|| < A/2 there is precisely one x G R'' such that \\x — u\\ < r and 
f{x) ~ y. Furthermore, the inverse map is on {x G R'' : \\x — wjlj < A}, and Df~^{x) ~ {D f {x))~^ on 
this set. 



Notation. We will generally assume that n > no, fc > ko for certain large enough constants no,ko. We are going 
to use the asymptotic symbols 0{f{x)), fl{f{x)), etc. It is understood that the asymptotic is with respect to the 
parameter x of the function f{x). Thus, if / is a function of n, then the asymptotic notation refers to the limit n — > 00, 
and if / is a function of k, then the notation refers to k being large. We use the following convention for the 0-notation 
in the case that / is a constant: we let 0(1) be a term that remains bounded in the limit of large n, but that may by 
unbounded in terms of k. By constrast, Ofc(l) refers to a term that remains bounded both in the limit of large k and 
large n. Expressions such as 0^(1) are to be interpreted analogously. Generally, all asymptotics are uniform in the 
various other parameters (such as the degree sequence d or r). For a function f{k) > use the symbol 0{f{k)) to 
denote a function g{k) such that for some constant c > we have g{k) = 0{f{k) ■ \if J{k)). For vectors ^, ry we use 
the symbol 

v = i 

to denote the fact that ||^ - -qW^ < 0{l/n). 

Let V ~ {xi, . . . ,Xn} and let L — {xi, ^xi, . . . , a;„, ^x„}. For a literal / G i we let |Z| denote the underlying 
variable. Moreover, sign(Z) = 1 if Hs a positive literal, and sign(Z) = — 1 otherwise. For a fc-CNF $ we let $i denote 
the ith clause of $ and $ij the jth literal of $i. 

From here on out, we let 

3 

r = 2~'=ln2-p with p=-ln2-efc (14) 
for some sequence Sk — Ofc(l) that tends to sufficiently slowly. 



5 The random variable 
5.1 The construction 

Our goal is to make the second moment method work for a random variable that counts "asymmetric" satisfying 
assignments. In this section, we develop this random variable. The starting point, and the key ingredient, is simply a 
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map p : Z ^ [0, 1]. For the sake of clarity, we start by setting up the framework for generic maps p; below we will 
use the Belief Propagation formalism to pick the "optimal" p. 

The idea is that p prescribes how strongly the assignments that we work with lean toward the majority vote. 
Informally speaking, we are going to work with assignments such that a variable x that occurs times positively and 
d^x times negatively has a p{dx — d^x) chance of being set to 'true'. Before we give a formal definition, we need to 
fix the number of times that each variable appears positively or negatively. 

Fixing the majority weight. As we saw in Section|2] in order to make the second moment argument work, we need to 
rule out fluctuations of the majority weight. To achieve this, we follow the strategy outlined in Section |2] That is, we 
are going to work with formulas $d with a given vector d = (dx, d^x)x£V of occurrence counts, where each variable 
X appears precisely dx times positively and d^x times negatively. As in Section |2] we let D denote the (conditional 
Poisson) distribution over sequences d such that first choosing d from D and then generating is equivalent to 
choosing a fc-CNF $ uniformly at random. 

Fixing the marginals. Now, fix one such vector d. Then the map p : Z — > [0, 1] induces a map pd from the set 
L = {x, -^x : X G V} of literals to [0, 1] in the natural way: we let 



The idea is that, given d, we should set variable x to 'true' with probability (2:) ■ 

To formalize this, we call Pd{l) the Pd-type of the literal I. Let T = Td = {Pd{l) I E L}he the set of all possible 
Pd-typss- We say that a : V ^ {0,1} has pd-marginals if for any type t £ Td^^s have 



i.e., among all occurrences of literals of type t,nt fraction is true under a. This definition captures the above idea that 
variable x has a Pd{x) chance of being 'true'. 



Fixing the clause types. We define the pd-type of a clause li V ■ ■ ■ V h as the fc-tuple {pd (h) , . . . ,pd {Ik)) <= [0, 1] 
comprising of the individual literal types. Let C = £d = Td be the set of all possible clause types. For each £ E Cd 
let Mq,^{£) be the set of indices i e [m] such that the ith clause of #d has type £, and let ■m^^{£) = \Mq,^{£)\. 

In addition to fluctuations of the majority weight, we also need to suppress fluctuations of the numbers m*^(^). 
We are going to use the same trick as in the case of the majority weight. Namely, we split the generation of a random 
formula #d into two steps: 

First, choose a vector m = {m{£))i^c from the "correct" distribution Md- 

Then, generate a formula ^d.m uniformly at random in which each variable x appears exactly dx times posi- 
tively and exactly d-,x times negatively and that has exactly m{() clauses of type t for all £ E C. 

Formally, the "correct" Md is just the distribution of the random vector mq,^ = {m^^{£))i^c that counts the clauses 
by types in the "unrestricted" formula It is easily verified that the overall outcome of the above experiment is 
identical to From now on, we fix both d and m. 

Given d, m there is a simple way of generating the random formula ^d.m- Namely, create di clones of each literal 
I, and put all the clones of a given p^-type on a pile. Then the formula ^d.m is simply the result of matching the 
clones on the type t pile randomly to all the clauses where a literal of type t is required. 

An assignment a with p^-marginals splits each pile into two subsets, namely the clones that are true under a and 
those that are false. For each type t, among the clones in the type t pile, a ^-fraction are true, since a has pd-marginals. 
Therefore, we expect that under the random matching, for each clause type £ = {£1, . . . ,£k) and each index j, in an 
^j-fraction of clauses the jth literal is matched to a 'true' clone. 

Judicious assignment. This observation motivates the following definition. We say that an assignment <t is pd- 
judicious in ^d.m if for all clause types £ — {£i, . . . , £k) G C and all j E [k] we have 



Pd{x) ^p{dx - d-,x) andpdi^x) = 1 -p{x). 



(15) 



^ {a{l)-t)-di=0{l). 





(16) 



8 



where ^d,m.i,j denotes the jth literal of the ith clause of ^d.m, and the sum is over all i such that the ith clause has 
type £. Let Sp{^d.m) be the set of p-judicious satisfying assignments, and set Zp{^d,m) — \Sp{^d,m)\- 

Given that a is p-judicious, in order for a to be satisfying we just need that for each type £ the 'true' clones are 
distributed so that each clause receives at least one. Thus, the event of being satisfying is merely a matter of how 
exactly the 'true' clones are "shuffled" amongst the clauses of type £, while for each j the total number of 'true' clones 
of type £j is fixed. In particular, this shuffling occurs independently for each clause type. Such random shuffling 
problems tend to be amenable to the second moment method. Therefore, it seems reasonable to expect that a second 
moment argument succeeds for Zp{^d.m)- This is indeed the case for r < rep — 1 + ln2 « tbp — 0.3. However, to 
actually reach tbf we need to control one further parameter 

Fixing the cluster size. According to the physics predictions ||25l [271 . for rbai < r < rep the set of satisfying 
assignments decomposes into an exponential number of well-separated 'clusters'. More precisely, we expect that 
w.h.p. for any two satisfying a, r either dist((T, r) < O.Ol??, (if a, t belong to the same cluster), or dist((T, r) > 0.49n 
(different clusters). Formally, we simply define the cluster of a as 



dist((T, t) 



2 '2 



The intuitive reason why the second moment argument for Zp{^d.m) breaks down for r close to rep is that the cluster 
sizes |CCT(4'd,m)| fluctuate. A similar problem occurred in prior work on random fc-NAESAT lfm[T2l . 

As in those papers, the problem admits a remarkably simple solution: let us call an assignment a good in ^d,m if 



\Ca{^d,m)\ <E[Zp{^d, 



(17) 



Let 5p,good(*d,m) be the set of afl good a e Sp{^d,m)- To avoid fluctuations of the cluster size, we are just going 
to work with Zp,good = \Sp,goodi^d,m)\- 

The second moment bound. We now face the task of estimating the first and the second moment of ^p,good(^d,m)- 
The result can be summarized as follows. 

Theorem 5.1 Suppose rbai < r < rep. There exists C ~ C{k) and a map p = pbp : Z — > [0, 1] such that for d 
chosen from D and for m chosen from Md w.h.p. 

< E i^d.mf] <C-E [Zp,good(*d,m)]' . 

Together with Paley-Zygmund (|5]l, Theorem lS. ll shows that with d chosen from D and m chosen from Md w.h.p. 



P [*d.m is satisfiable] > P [^p,good(*d,»n) > 0] > 



E [^p,good(*d,m)]^ 
E[Zp.good(*d,m)2] '- C" 



> 



(18) 



The construction of D, Md ensures that choosing ^> at random is the same as first picking d from D and m from 
Md and then generating ^d.m- Therefore, ( fTSl l implies liminf„_j.oo P is satisfiable] > 0, so that Lemma [TTI 
yields r^sAT > ''bp- Hence, we are left to prove Theorem l5.ll We begin by constructing the map pbp- 

Guessing the marginals. For a set ^ S" C {0, 1}^ and a variable x we define the S-marginal of x as 



E 



a{x)_ 

\s\ ■ 



(19) 



The definition of 'pd-judicious' is guided by the idea that Pd{x) should prescribe the marginal of x in the set of all 
Pd-judicious satisfying assignments. Hence, in order to make the set of p^-judicious assignments as good an approxi- 
mation of the entire set of satisfying assignments as possible, we better pick p so that Pd{x) is a good approximation 
to the actual marginal /i5(#^)(a:;) of x in the set of all satisfying assignments. The problem is that, because of the 
asymmetry of the fc-SAT problem, these marginals are highly non-trivial quantities. Indeed, on general formulas <I> the 
marginals (x) are #P-hard to compute. 

However, according to the physicists' cavity method, on random formulas with density r < rep the marginals can 
be computed by means of an efficient message passing algorithm called Belief Propagation ESl . While the mechanics 
of this are not important in our context, the result is. 
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Conjecture 5.2 Suppose that rbai < t < rep. Let d be chosen from D and let x be a variable. Then w.h.p. 



1 dr — d-,rr „ f dx — d-,x \ 
M5(*.)(x)=2+^2l+r^ + 0(^^^j . (20) 

We observe that ( l20l i is in line with the notion that iS($d) is "skewed toward" (J,naj- Indeed, the conjecture quantifies 
how much so. Motivated by Conjecture 15. 21 we define 

^ if Izl < 10Vk2''lnk, 




PBpiz)=<^ t (21) 



Under the distribution D, the random variables d^, d^^ are asymptotically independent Poisson with mean kr/2 (cf. 
Section|2]i. Therefore, 

Ed[{dx-d^xf] =kr<k2Hn2, 

and standard concentration inequalities show that w.h.p. there are no more than n/k^^ variables x with {d^ — d-,xY > 
100fc2'^ In k. Hence, pd = PBP,d is (asymptotically) equal to the conjectured value on the bulk of variables w.h.p. 

In summary, the problem with the "vanilla" second moment argument is that the drift toward dmaj induces correla- 
tions amongst the satisfying assignments. Indeed, they are correlated with the majority assignment and thus with each 
other. We circumvent this problem by explicitly prescribing the marginal probability that each variable is set to 'true' . 
One could think of this as working with the intersection of S{^) with a particular "surface" within the Hamming cube 
{0, 1}", namely the assignments with p^-marginals. Within this surface, all assignments are slanted equally toward 
(Tjnaj ■ The Belief Propagation-informed definition of pep is meant to ensure that the surface that we consider with is 
(about) the most populous one, i.e., the one with the largest number of satisfying assignments in it. The core of our 
argument will be to show that with respect to the marginal distribution pbp, i-C-, within the surface that pep defines, 
two random elements of Sp{^d,m,) are typically uncorrected. But before we come to that, we need to compute the 
"first moment", i.e., the expected number of good psp-judicious satisfying assignments. 

Remark 5.3 Belief Propagation actually leads to a stronger prediction than Conjecture 15.21 Namely, it yields a 
conjecture for Hs{^^){x) up to an additive error then tends to as n oo. However, (a) this stronger conjecture is 
not in explicit fonn, and (b) it does not only depend on d^, d^x, but also on various other parameters. In any case, 
even a more accurate prediction would not yield a better constant than § In 2 in Theorem \l.l\ 



Remark 5.4 In the present framework, the notion of balanced satisfying assignments from ^ simply corresponds to 
working with the constant map ptai ^ Z — >■ [0, 1] , z i— > i. This hightlights that the improvement that we obtain here 
stems from choosing the non-constant map pbp inspired by Belief Propagation. 

Remark 5.5 The definition M9^ of the marginal of a set gives rise to a formal notion of 'symmetric problem '. Namely, 
we could call a (binary) random CSP symmetric if its set 5csp(^) of solutions is such that for each variable x w.h.p. 
we have px{Scsp{^)) = 5 + Clearly, k-NAESAT passes this test as ^x{S^A'e{^)) = ^ for all x with certainty. 
Similarly, the problem of having a balanced satisfying assignment is symmetric as is random k-XORSAT. 

From here on out we keep the assumptions of Theorem l5.11 In particular, we assume rbai < r < rep. Let d 
be chosen from D, and let m be chosen from Md- Let p = pbp be as in jlH and pd as in (US) . 



5.2 Typical degree sequences 

We need to collect a few basic properties of the sequence d chosen from D. Let us call a sequence d ~ ((i()ieL of 
non-negative integers such that X^ieL di = km a signed degree sequence. For a fc-CNF $ let d ($) ~ [di 
denote the vector whose entry di ($) is equal to the number of times that literal / occurrs in $. Then D = Dk{n, m) 
is just the distribution of the signed degree sequence 

The signature of a literal I E L with respect to a signed degree sequence d is the triple (sign(Z) , d^n,d^^n). We 
omit the reference to d if it is clear from the context. Let T = T{d) be the set of all possible signatures. For 
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each literal I we let T{1) denote its signature. Furthermore, for a signature = (sign(Z), , G T we let 

-^9 = (-sign(0,rf|;|,d^|;|). 

Let d be a signed degree sequence. A fc-CNF $ over V is drcompatible if — d. Thus, 



A * 



is a uniformly random d-compatible fc-CNF. 

In the sequel we are going to prove statements about the random formula $d for a "typical" signed degree sequence 
d. Formally, this means that we first choose d from the distribution D at random. Then, conditioning on d, we will 
study the random formula Thus, there are two levels of randomness: the distribution of d and then, given d, the 
choice of the random formula When referring to the random choice of d we use the notation [•], [•]. By 
contrast, if we choose $d randomly for d fixed, then we use P [•], E [•]. 

Lemma 5.6 1. Let £ be an event such that P G ~ o(l). Then w.h.p. a signed degree sequence d chosen 
from the distribution D is such that P [^d G f] = o(l). Conversely, if w.h.p. for a random d chosen from D 
we have P G 5] = o(l), then P [* G f] = o(l). 

2. For any random variable X > and any e > we have P^ [E > E / e] < e. 

Proof. The first claim follows from Markov's inequality as P G = E^ [P [€>d G £]]. The second claim follows 
from from Markov's inequality as well because E = E^ [E ^ 

Lemma 5.7 For d chosen from D the following statements hold w.h.p. 
^- T.xevi^'x - d^xY ^ km. 

3. Let Ai contain the n literals of largest degree. Then -j^^ d, 



0(2 



-fc/2\ 



km ^leM ' 

Proof. We use the following description of the distribution D. Let e — {ei)i,^L be a family of indepedent Po(fc7-/2) 

variables. Moreover, let £ be the event that X];eL ^ known that e given £ has distribution D. 
Furthermore, a simple calculation based on Stirling's formula yields 

P [£] = Q{ir^/^). (22) 

Let e; = min {e/, In^ n}. Employing Stirling's formula once more, we find that P [e/ 7^ e;] < n~^^ for all I G L. 
Hence, by the union bound. 



P [V/ G L : e/ = e,] > 1 - 



(23) 



Furthermore, as e^, e-,x are independent for any a; G we have 

E [(e, - e^xf] = 2Var(e,) = 2Var(e,) + 0{n-^) ^ kr + 0{n-^). (24) 
Because ei < In^ n and the random variables | (cx — ^^x)^}^^y are mutually independent, Azuma's inequality yields 

„l/3 



J2{ex-e^x? 



xev 

Combining (I22]i-(l25ll. we find 



^{dx - d^x)"^ - kr 
xev 



xev 



e-,x 



> n 



2/3 



> n 



3/4 



P 



< 



xev 
e(ni/2)P 



< 2exp 



km 



< n' 



-10 



(25) 





£ 







km 



xGV 

< o{l) + Q{n^/^)F 
= 0(1), 



> n' 



3/4 



{ex - e^x)"^ - E (eg; - e- 



xev 



x£V 



> n 



2/3 
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thereby proving the first claim. The second claim follows from the first by means of the Cauchy-Schwarz inequality: 
w.h.p. 



Tl ^ ^ 



kr. 



xev 



Finally, the third assertion is immediate from the second. 

For a set C L we let Vol(5) = Vold(5) = J2ies '^i- 

Lemma 5.8 Let d be chosen from D. Then w.h.p. the following is true. 

For any set S <Z L of literals we have Vol(S') < 1015*1 maxjfcr, ln(ri/|S'|)} . Furthermore, if 
\S\ > n2-0■8^ thenYo\{S) > ^\S\kr. 



□ 



(26) 



Proof. We use the alternative description of D from the proof of Lemma 15^ That is, e = {ei)ieL is a family of 
indepedent Po(/sr/2) variables, and £ is the event that X^zgl ^' ~ ^ ^ kr/2. For any fixed set S" C L the 

random variable Xs = X^zes has distribution Po(|5'|A) (because the sum of two independent Poisson variables is 
Poisson). Therefore, letting = lOlS"! maxjfcr, ln(n/|5|)}, we obtain from Stirling's formula 



p[^s>m] < o{^)v[Xs^\^i^]<o{^) 



/i! cxp(A) 



< 0(%/^) 



eA 



cxp(— A). 



(27) 



For 1 < s < 2n let Xs = Ss |s|=s '^Xs>ti- Then ( |27] ) yields 

< O(V^) (^^"^ • exp(-A - ^) < O(V^) 



EX., 



2cn 

s 



exp(— A ~ fi) — o{l/n^), 



because /i > 10s ln(n/s). Thus, the first claim follows from ( |22] | and the union bound. 

To prove the second claim, we use Lemma IJ!6] For S C Lwe let Ys be the total number of occurrences of literals 
from S in Then Ys has distribution Bin(fcm, |S'|/2n) with mean |5'|fcr/2. By the Chernoff bound. 



P[Ys < kr\S\/3] < cxp 



kr\S\ 



Hence, letting Ys = Es:|S|=s lys<fer|s|/3. we get from 



100 

for s > n2-"-«'= 



(28) 



E[Ys] < 



cxp 



krs 
100 



< exp 



s(2 + k) 



krs 
100 



o(n-2). 



Thus, by the union bound P [Vs > n2 ^■^'^ : Yg = O] = 1 — o(l/rt). Applying Lemma |52] completes the proof. □ 
For any t E T we let n{t) be the number of variables x E V such that Pd{x) — t. 

Lemma 5.9 Let d be chosen from D. Then w.h.p. for any type t E T we have 

n{t) > 2-3fc/4„. 

Proof. We use the alternative description of the distribution D from the proof of Lemma |5^ That is, let e = {ei)ie l 
be a family of indepedent Po(fcr/2) variables, and £ be the event that X^/ei ~ ^'^^ ^' ^ ^) 
denote the number of literals I such that sign(/) = s and e|;| — e^|/| = A. Since Var(ei) = kr/2 = rifc(fc2'^), for any 
s G {±1} and any A such that A^ < 100A:2'^ In k we have E [X (s, A)] > nk~'^ for some absolute constant c > 0. 
Furthermore, because the random variables (e;)/^^, are mutually independent, the Chernoff bound implies that 



X{s,A) < -nk" 



< exp(-0(n)) providedthat A2 < 100A;2'' lnfc. 



(29) 
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Similarly, if we let X'^ denote the number of literals I such that sign(Z) = s and |e|;| — > 100^2*' In fc, then 

E [X' (s)] > nk^'^ for some absolute constant c' and 



X' (s) < -nk- 



< exp(— r2(n)). 



Thus, the assertion follows by combining ( |22] |. ( |29] l and dSOl l. 

For each t E T we let 7r(t) denote the fraction of literal occurrences of p-type t, i.e., 

di 



(30) 

□ 



^ ' i-^ km 



For each £ G £ let 



7£ = -E[to*,(£)]. 



Lemma 5.10 Let d be chosen from D. Then w.h.p. 7^ ^ Y[j=i "'(^i)/'"" all i ~ {£1, . . . ,£k) G C. 

Proof. By the linearity of expectation, we just need to compute the probability that the first clause has type £. 
Since |T^^(t)| = ^{11) for all t <E T, the types of the k literals of are asymptotically independent. Thus, the 
assertion follows from the fact that 7r(£j) equals the marginal probability that a random lityal has type ij. □ 

Lemma 5.11 W.h.p. for d chosen from D we have P [V^ e £ : — 7£n| < n^/'^] = 1 — o(l). 

Proof. Fix a type £ = (^i, . . . ,£j). Because p is a feasible marginal, for any j G [k] there are Vl{n) literals I with 
p{l) = p{£j). Therefore, a straightforward calculation shows that 

P [^d,i has type £\^dM has type £] = P [^d,i has type £] ■ {I + 0{l/n)) for any i ^ h. 

Consequently, Va,r{m^^{£)) ~ E [m^^{£)] = 0{n). Hence, by Chebyshev's inequality 



|m*,(^)-E[m*,(^)]| > 



,2/3 



0(n-i/3)^o(l). 



Since |£| = 0(1) asn^ cx)by the construction of p, the assertion follows from dSTI i and the union bound. 



(31) 

□ 



6 The first moment 
6.1 Outline 

Let p > I hi 2 be such that r = 2*" In 2 - p. 
Proposition 6.1 W.h.p. d, m are such that 

E [^p,good(4'd,i 



exp 



n 



In 2 
~2~ 



Ofc(l) 



We begin by computing E [Zp{^d,m)]- By definition, any assignment that is pd-judicious has pd-marginals. Thus, 
let 'Hp{d) C {0, 1}^ denote the set of all assignments that have p^-marginals. Then by the linearity of expectation, 

E[Zp(*d.r„)] - P k e 5p(*d,r,^)] ■ (32) 

Hence, we need to compute \Hp{d) \ and the probability P [a E Sp{^d,m)] for any cr G Hp{d). Using basic properties 
of the entropy, we obtain 
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Lemma 6.2 Let = —z In z — (1 — z) ln(l — z) denote the entropy function. Then w.h.p. d is such that 

\n\Hp{d)\^n-J2x{pix)). 



Taylor expanding x{z) around z — 1/2 and plugging in the definition (I2TI 1 of p, we obtain that w.h.p. d is such 



that 



ihi|Hp(d)Hln2-^ + o.(2-'=). 



(33) 



As a next step, we compute the probability of cr e Sp{^d.m) for a G Hp{d) 
Lemma 6.3 W.h.p. d, m are such that for any a G Hp{d), 
1 



lnP[ae5p(*d,m)] 



, „ kin 2 „ 



In 2 



(34) 



Let us defer the proof of Lenima l63l which is the core of the first moment computation, for a Uttle while. Com- 
bining ([32ll-(|34ll. we see that w.h.p. over the choice of d, m we have 



hlE[Zp(*d,rri)] = ln|Hp(d)|+hiP[ae5p(*d,r„)] 



1-fe 



111 2 

P- — +Ok{l) 



(35) 



To obtain the expectation of Zp.good, we show the following. 
Lemma 6.4 W.h.p. over the choice ofd, m we have 

E [Zp,good(*d,r,^)] - E [Zpi^d.m.)] • 

The proof of Lemma 16741 is based on arguments developed in |[T| for analyzing the geometry of the set of satisfying 
assignments. Combining (35[ and Lemma |6T4l vields Proposition |6T| 

6.2 Proof of Lemma 16.31 

For a sequence m ~ (m{£))£^c of non-negative integers we let Tm denote the event that m#^(£) = m(£) for all 
i £ C. Let us call m feasible if Pm [Tm] > and \m{£) — jen\ < r?l'^ for all I ^ C. Let Z be the number of 
Pd-judicious satisfying assignments. 

Proposition 6.5 Let d he chosen from D. Then w.h.p. for any feasible m ~ {m{£))£^c the following statements hold. 
L We have 



In 2 , 

n k 

^ 2 



< -lnE[Z(*d.m)] <2-'^ 
n 



In 2 , 

P- — + k- 



2. For any t we have 



< 



l<£L:pa{l)=t 



2knnr{t) 
nit) 



3. For any a G {0, 1}^ with p-marginals we have 



km 
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The proof of Proposition |63] consists of two steps. We defer the proof of the following lemma to Section l631 
Lemma 6.6 With the assumptions of Proposition \63\ and with 6, 6' defined by 



'-^''-^ and 



— V(l - 2p{x)){d, ~ d^,) = -(1 + 5')2-^ 
km ^ — ' 



we have w.h.p. 



ilnE[Z(*d,rr.)] = 2-'= 
n 



In 2 
P TT 



O 



f k{5 + d') 



j +0(2-3fc/2). 



Proof of Proposition \6.5\ Let A = 100fc2'"' In k and let (S, 5' be as in Lemma |63] Using the alternative description of 
the distribution D from the proof of Lemma 15^ and applying Azuma's inequality, one can easily verify that w.h.p. 

^ l(d,_d_)2<A • {dx - d-.,f > (1 - ^ {^d, - d^.,f. (36) 
xev x^v 

Therefore, Lemma lS/Tl entails that w.h.p. 

k^ ) 9 / 

Consequently, w.h.p. we have 



km ^ V 2 / /fcm ^ 4'=+^ 4'=+i 

a;ey ^ ' xev 



5 = Ofc(fc-i2). (37) 
Similarly, invoking (l36T l once more, we see that w.h.p. 

whence 

6' = Ok{k-^^) (38) 
w.h.p. Thus, Proposition |6.5| is a direct consequence of Lemmas |5.9| and [631 and ( |37] l. (|38] l. □ 

6.3 Proof of Lemma 16.61 

We begin by determining the number a S {0, 1}^ with p-marginals. The following is an easy consequence of 
Lemma l6?2l 



Corollary 6.7 W.h.p. for d chosen from D we have 

^. In \Hip)\ = ln2 - - ^ (p{x) - J) + 0(2-^'^/^) 



n ^ V 2 

xev 
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Proof. This follows from Lemma l672l bv Taylor expanding x(-) around i. □ 

We need to compute the probability that an assignment a £ {0, 1}^ with p-marginals is a p-judicious satisfying 
assignment. To this end, we introduce a new probability space (f2, P). Let q = {qe,j)eec,je[k] be a matrix with entries 
in [0, 1]. The elements of our new probability space 17 are all 0/1 vectors 

{^i]{(^))iec.ie[m{e)].]<£[k]- 

The distribution P is such that the entries a-ij{£) are mutually independent, and for each £ = {£i,. . . ,£k) G C, 
i e [m{i)], j £ [k] we let crij{t) — Be(g^.j) be a Bernoulli random variable. (It may be helpful to think of crij{£) as 
the truth value of the jth literal of the ith clause of type £ in a random formula ^d.m ) 
For £ = (€i , . . . , €fc ) e £ let (£) be the event that 

maxCTi,(£) = 1 

ie[fc] 

(the intuition is that this corresponds to the event that the clause i of type £ is satisfied). Let S{£) = C\i£[m(i)] ^ii^) 
and S = Cliec ^i^)- Moreover, for j £ [k] let B{e, j) be the event that 

Let Bit) = rij^i B{£,j) and B = H^gc The connection between the probability space fi and Lemma 1631 is 

as follows. 

Lemma 6.8 Suppose that a £ {0, 1}^ has p-marginals. Let S{a) be the event that a is a satisfying assignment of 
^d,m and let B{a) be the event that a is pd-judicious. Then P [iS(cr)|yB(cr)] = P [5*1-6] . 

Proof. Note that in P [S{(t) \B{(j)] probability is taken over the choice of the random formula ^d,m, while in P [5*15] 
probability is taken over & chosen from the above distribution. Thus, we need to relate the two probability spaces. 
For any d-compatible formula $ £ Tm we can define a map 



«6£,iG[m(f)] jG[fe], 



by letting (T^ (£)|$ be the truth value of the jth literal of the ith clause of type £ in $ under a. In other words, o-j^ 
is the string of truth values that we get by "plugging the assignment a into Then a is judicious iff aq, £ B. 
Furthermore, <t is satisfying iff (t$ £ S. Finally, if a has p-marginals, then o-j.^.^ ^ becomes a random vector Given 
B{a) its distribution is identical to the conditional distribution of & given B. □ 

Corollary 6.9 With the notation of Lemma \6.8\ we have P [S{a) D B{(t)] = P [S\B] exp{o{n)). Moreover, for any a 
with p-marginals we have P [.B(ct)] = O 



(\r\-k\c\)/2\ 



n 

Proof. Since the total number |£| of clause types is bounded, the assertion follows from a repeated application of 
Lemma l4n (the local limit theorem). □ 

Thus, we have reduced the proof of Lemma l676l to the computation of P [S'ji?]. The benefit of the probability space D, 
is that 5", B can be decomposed easily into independent events. Indeed, for any I £ C and any i £ [m{t)\ we have 

k 

P[5,(£)] = l-[]l-go-> 

i=i 

because the o-y (£) are independent. Moreover, due to independence and because m is feasible, 

k 



i In P [S{£)] = ^ In P [5. {£)] ~ n In 



1 ni 
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and thus 



Similarly, 



-lnP[^]^^7,ln 



1 ni 



ilnP[B] = l^lnP[i3(£)] = l^^lnP[i3(£,j)] 

77 77 77 l—^ 



(39) 



(40) 



A further benefit of the space O is that we are free to choose the vector q as we please (subject only to the condition 
that P \B\ > 0). To facilitate the computation of P [5*1-8], we are going to choose q such that 



For if RT\ holds, then 



P [B\S] ^ exp(o(n)). 



P[^l^] = j7§-cxp(o(n)), 



(41) 



P[B] 



where P [S], P [B] can be calculated rather easily via ( [39] ) and ( l40b . Thus, as a next step we need to find q such 
that (HTt is true. To this end, we define 



1 - n;=i 1 - m 

Lemma 6.10 There exists q such that qe.j = ij for all £ = {£i, . . . , £k) G C, j G [k]. Furthermore, this q satisfies 



(42) 



',i ~ 2 



Proof. For any £, j we have 



dq, 



dqt,j 



l-(l-2g,,,)n,^,l~g£,i 



(43) 



Hence, for k large enough and 0.01 < qj < 0.99 for all j, the k x k matrix Dq is close to id. In particular, this is true 
for qj close to 1/2. Therefore, the assertion follows from the inverse function theorem (Lemma [4.4t . □ 



Corollary 6.11 With qfrom Lemma WT^ we have P [B\S] ~ 0(; 



,-k\C\/2\ ^ 



) = exp(o(n)) and thus ( 147 H . 



Proof. Equation (|42] | shows that for the vector q from Lemma 16.101 we have E [crij{£)\Si{l)] = Ij for all £,j, i. 
Therefore, a repeated application of Lemma |4. ll vields P [B\S] = 9(n^'^l'^l/^) = exp(o(n)). □ 

From this point on we fix q as in Lemma [6.12\ 

Lemma 6.12 Letting 

S = T^5I(l-2Pd(a^))(^--^--)' (44) 



xev 



we have i In P [S"] = - In 2 + 2-'' [p 



In 2 



A: In 2] - /cS In 2 + 0(2-3/^^/2). 
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Proof. Starting from (|39] l, we obtain 

-lnP[5] ^ ^7^11 



E 

tec 



1 - n 1 - -^^.^ 



A; \ /A; 



where we used the approximation ln(l + 2 + 0(i-3). Thus, Lemma ISTTOl yields 



■lnP[5] = -^7, 



lec 



tec 3 = 1 



Further, by ( |43T l 



-InPfS-] 
n 



. 4-^" - kr ■ A-*" + 0(2-3'=/2) _ ^ 



teT 



Now, 



E-w f*-^ 



El^(f(^-)-^) + T^(J-p(^; 



- E - d^,) ('p(x) - = -E/2 



km 



Hence, 



ilnP[5] = -^.4-^-fcr.4-^-r (^ii^j +6(2-3^/2) 

= -r • 2-'' -^-A-^ -kr- 4"*^ - fcrS2-'^ + 0(2-3^/2). 



Plugging in r = 2'° In 2 — p, we get 
1 



-lnP[S'] = -ln2 + 2 
n 



ln2 , , „ 

p fc In 2 

^ 2 



-/cEln2 + 0(2-3'^/2)^ 



as claimed. 



(45) 
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Lemma 6.13 We have \ InP {B\ = + Oi^'^^l'^). 

Proof. Due to ( l40l i we just need to estimate In P j)] for any £ = [li, . . . ,tk) G 'C and j e [fc] . By construction, 

P [B{t, t)] = P [Bin(m(£), g,,,) = Ijm(l)] . 
By Lemma lelOl we have £j - 2-''-i + 0(2-3fe/2) = i + 0(2"''/^). Hence, using Lemma 1431 we find 



lnP[B(£,j)] 



-(2 + 0(2-^/2)) fe,-£,r 

- (2 + 0(2-'^/2)j 1^2-^-1 ^ Q^2-3fc/2))2 = - Q + 0(2-'^/2)j 2 



Hence, ( |40l ) yields 



■InPfBl 



EE 



m(€) 



-2fc 



-kr ■ 



1 . 2-2fe ^ 0(2^5'^'/2) 



/cln2 



ofc- 



f + 0(2-3''^/2)^ 



as claimed. 



Remark 6.14 In the second moment calculation we will need to know that 



In I 1 - n 1 - 



which follows from f l45l ) one/ ( 
Corollary 6.15 Lef 5,5' > Q be such that 



E (pdi^^ - \ 



_ {l + 5)km 

~ 22fc+2 ' 

S = [\ + 5')2-^ 



Then with r = 2 ^ \n2 — c we have 

ln|H(p)| +lnP[^|B] 



In 2 



O 



with "Efrom 1 



Proof. By the above. 



-lnP[5] 
n 



■lnP[Bl 



-ln|H(p)| 



-ln2 + 2-'^ L-i^-fcln2) - fcS In 2 + 0(2-^^/2) 



-ln2 + 2-^ yp- — 
''^ + 0(2--/^), 



ln2\ M'ln2 



2^- 



1-2--E M 



Key 



2fc 



In 2 



0(2-3^/2)^ 



fcln2 (5fcln2 



2*:+! 2*^+1 



(46) 



□ 
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Summing up yields the result. 

Proof of Lemma |6!6] Lemma |63] is a direct consequence of Corollaries |6.7[|6^l6.1 l| and |6.15l 



□ 
□ 



6.4 Proof of Lemma 16.41 

Assume that m is feasible. Let Z denote the number of good p-satisfying assignments. 

Proposition 6.16 Let d be chosen from D and let m be chosen from Ma- Then E [Z{^d,m)] ~ E [Z(^>d -m,)] w.h.p. 

The proof of Proposition |6. 16| is based on three lemmas. 
Lemma 6.17 Let d be chosen from D and let m be chosen from Md- 

L Let £ be an event such that P G £] = o(l). Then P [^d,m ^ £] = o(l). 

2. For any random variable X >Q and any e > we have Pd,m [E [X(^'d)] > E / e] < e. 

Proof. This follows from a similar application of Markov's inequality as in the proof of Lemma |531 □ 
Lemma 6.18 With the assumptions of Proposition [67/61 the random variable 

Z' ^\{ae Si^d^m) : |{t e 5(*d.m) : dist{a,r) < 2-°-^9'=n}| < E [Z(*d,rr.)]} | 
satisfies E [Z'{^d,m)] ^ E [Z(*d.m)] w.h.p. 

The proof of Lemma l6.18l can be found in Section 1675] Moreover, in Section l631 we prove the following. 

Lemma 6.19 Suppose that r < 2^ In 2. Let ^ = k2~^/'^. Let Z" be the number of pairs {a, r) S such that 

dist{a,T) € [k2-'',l] \ 



Then E [Z"] = o(l). 

Finally, Proposition 16.161 follows immediately from Lemmas |6 . 1 71 16 . 1 8] and 16 . 1 9l 

6.5 Proof of Lemma 16.181 

Let $ be a fc-CNF and a E We say that a variable x is £^-rigid in ($, a) if for any t e with t{x) ^ a{x) 

we have dist(a-, r) > ^n. Let A = kr/{2^ - 1). 

Lemma 6.20 L The expected number of a E in which more than k^'^2~^n variables support at most 12 

clauses is < cxp(— nA:^/2'^)E 

2. The expected number of a E in which more than (1 + l/fc^)2~'^n variables support no clause at all is 

< exp(-n/(fc62'=))E|5(*)|. 

Proof. Fix an assignment a G {0, 1}^, say a = 1. Then the number of clauses supported by each x £ V is 
asymptotically Poisson with mean A. Let be the event that x supports no more than 12 clauses. Then 

P[£J < Ai2cxp(-A) < h^^2-\ 

The events {£x)xev are negatively correlated. Therefore, the total number X of variables x E V for which £x occurs 
is stochastically dominated by a binomial variable Bin(n, ^k^^2^'^). Hence, the first assertion follows from Chernoff 
bounds. 

With respect to the second assertion, let £'^ be the event that x supports no clause at all. Then P [£x] < cxp(— A). 
Using negative correlation and Chernoff bounds once more completes the proof. □ 

Let us call a set 5 C self-contained if each variable in S supports at least ten clauses that consist of variables in 
S only. There is a simple process that yields a (possibly empty) self-contained set S. 
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• For each variable x that supports at least one clause, choose such a clause Cx randomly. 



• Let R be the set of all variables that support at least 12 clauses. 

• While there is a variable x E R that supports fewer than ten clauses ^ Cx that consist of variables of R 
only, remove x from R. 

The clauses Cx will play a special role later. 

Lemma 6.21 The expected number of solutions a £ S{^) for which the above process yields a set R of size \R\ < 
(1 - fciV2''> « bounded by exp{-nk^ /2'')E 

Proof. Let a e {0, 1}^ be an assignment, say a = 1. Let Q be the set of all variables that support fewer than 12 
clauses. By Lemma [6.201 we may condition on |Q| < k^^2~^n. Assume that \R\ < (1 — fc^^/2'^)n. Then there exists 
a set 5 C F\ (RUQ) of size < S < k^^n/2^ such that each variable in S supports ten clauses that contain 

another variable from S U Q. With s ~ \S\/n the probability of this event is bounded by 



TO 

lOsn 



lOsn 



< [4c/fc2,,]i°^" 



Hence, the expected number of set S for which the aforementioned event occurs is bounded by 

[Aek^s] < • [Aek'sf] < exp(-sn), 

which implies the assertion. □ 
Let us call a variable x is attached if x supports a clause whose other fc — 1 variables belong to R. 

Corollary 6.22 The expected number of a G S{^) in which more than n/{k'^2^) variables x ^ R that support at 
least one clause are not attached is bounded by E |5($)| • exp(— n/(fc^2'°)). 

Proof. Let F = V \ R. By Proposition l6.21l we may assume that |F| < nk^^ /2^. Therefore, for each of the "special" 
clause Cx that we reserved for each x that supports at least one clause the probability of containing a variable from 
F \ {x} is bounded by 

Furthermore, these events are independent (because the clauses Cx were disregarded in the construction of R). Hence, 
the number of variables x ^ R that support at least one clause but that are not attached is dominated by Bin(|F|, ^^). 
The assertion thus follows from Chernoff bounds. □ 

Let us call S C V dense if each variable in 5* supports at least ten clauses and at most 2k clauses such that at least 
ten of them feature another variable from S. 

Lemma 6.23 For d chosen from D, m chosen from Md and any a G {0, 1}^ the following holds w.h.p. Let A be 
the event that a is a p-satisfying assignment of^d,m- Then 

P [^d,m has a dense S dV, \S\ < n2-°-^'^^ \ A] = o(l). 



Proof. We may assume that d satisfies (l26l i: we emphasize that this is a property of d only, regardless of m or the 
event A. Let T){S) be the event that S* C is dense. We may fix (i.e., condition on) the specific clauses supported by 
each variable x £ S. Let x £ S and let i £ [m] be the index of a clause supported by x. Let i be the type of clause i. 
For each t e T let Vt be the set of literals I of type t. Then the probability that clause i contains another variable from 
S is bounded by 

Vol(F,^ n (7-1(0)) - '^er[ Yo\{Vtna-\0)) 
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Since \Vt\ > n2-"-^'= for all t w.h.p. by Lemma |5^ we have Vol(Vt) > lkr\Vt\ > 30 • 2°-2'=n. Furthermore, 
YoliVt n cr-i(O)) > iVol(yt) by the choice of p(t). Hence, ^ yields 

Vol(T4 n5'ncr-i(0)) Vol(5') maxjfcr, \n{n/\S\)} \S\ 

Voi(Ft nfj-i(o)) - ivoi(Ft) - 20-2'^7i ■ 

Due to negative correlation, in total we obtain 

(The factor (^p) accounts for the number of ways to choose 10 out of the at most 2k clauses that each variable in S 
supports.) 

For < s < 1 /k^ let Xg be the number of sets S of size \S\ — sn for which ViS) occurs. Then 



sn 



c (fc max {fcr, — ln(s)} s 



10 



-[ sn 



4k 

There are two cases to consider. First, if s < ln(n)/ri, then the term in the brackets is clearly o(l). Second, if 
s > ln{n)/7i, then we have the following bound. Since s < Smax = 2^°-^^'^' and as a; 1— >■ ln^° x is monotonically 
increasing for x < 0.1, we have 

Hence, the entire bracket is bounded by 2~^l'^ . Summing over aU possible s and using Markov's inequality completes 
the proof. □ 

Let us call a variable x (,-rigid in tj G S{^) if for any r S 5(<I>) with t{x) (j{x) we have dist((T, t) > ^n. 

Corollary 6.24 W.h.p. for d chosen from D and for m chosen from Md the following is true. Let g e {0, 1}^ and 

let A be the event that a is a p-satisfying assignment of^d.m- Moreover, let Y be the number of variables that are 
not 2~°-^'^'' -rigid. Then 

P [Y{^d^m) < (1 + 2fc-2)2-fe„ I ^] = 1 - 0(1). 

Proof. Let ^ = 2^°^^'"'. We condition on the event A. Consider a variable z that is either attached or in R. Let 
T g S{^d,m) be such that t(z) 7^ cr{x) and dist((T, r) < n/2^-^'^^. Because z is attached or in R, the set 

A = {.T e i? : t{x) ^ a{x)} 

is non-empty. Moreover, A is dense by the construction of R. Thus, Lemma 16.231 shows that dist(cr, r) > |A| > 
j^y'20.99fc jj p Hence, w.h.p. all z that are either attached or in R are ^-rigid. 
Further, let TZ be the event that 

• no more than (1 + \/k'^)2~^n variables support no clause at all and 

• at most n/ {k'^2^) variables x ^ R that support at least one clause are not attached 
Then Lemma 16.201 and Corollarv l6.22l implv together with Proposition l6.5l that 

P [^d,m en\A] = l- 0(1). 

Hence, the total number of vertices that either do not support a clause or that are not attached is bounded by (1 + 

2/k'^)2-^n □ 
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Proof of Lemma 167/51 Suppose that r = 2^^ In 2 — c. By Proposition l6.5l we have 

-lnE[Z(*d.m)] > 2-^- fc- i^+Ofc(l) 
n \ 2 

w.h.p. Now, assume that in cr G S{^d.m) all but at most (1 + 2fc~^)2~'°n variables are ^-rigid with ^ ~ 2~^-^'^^ . If 
r e is such that dist((T, r) < ^ri, then cr, t agree on all ^-rigid variables of a. Hence, 

-ln{Te5(*d^) :dist(cr,T) < ^n} < (1 + 2^-2)2-*^ = (1 + Ofc(l))2"'^ hi 2. 
n 

As c — ^ + Ofe(l) > (1 + Ofc(l)) hi 2 for c > | ln2 + e and k large enough, the assertion follows. □ 

6.6 Proof of Lemma 16.191 

By Markov's inequality, it suffices to bound the expected number of paris (tr, r) e S{^) at the given Hamming 
distances. More precisely, let be the number of pairs (cr, r) e such that dist(cr, T)/n = x. Let h{x) = 

—a; In a; — {\ — x) ln(l — .t) and set 

q{x) = r • In (l - 2^-^ + 2-^{l - a;)''') . 

Then 

-lnE[Z^] < hi2 + /i(a;) + (47) 
n 

We consider several cases. 



2*^ In 2 - c] 



Case 1: k2-^ < x 


< {2k)-K 


We have 




h{x) + q[x) 


+ ln2 < 


ln2 + x{l - Ina;) ^ 


-r{-2^''' + 2~''{l-x)'') 




< 


In 2 + x{l - Inx) ^ 


-2''\n2{-2^-'' + 2-''{l-x)'') +c2^~'' [as r 




< 


x{l - In a;) - In 2 4 


- (1 - .x)*=ln2 




< 


x{l - In a;) - In 2 4 


- {I - kx + k'^x^)ln2 




< 


x{l — Ina;) — fca; + 


k^x^ = a; [l — Ilia: — fc + /c^a;] . 


If k2-^ < X 


< k ^, then 1 — \nx — k + k^ 


a;<l — lnA: + l<0. Moreover, if k~^ < x < 


1 — Inx — k 


+ k'^x <l 


+ 21nfc- ffc < 0. 





Case 2: (2fc)-i <x < 0.01. We have 



h{x) + q{x)+\n2 < In 2 + a;(l - In a:) + r (-2^"''' + 2"'''(1 ^ 2;)'^') 

r r 

< In 2 + a:(l — hi a:) — ^k-i ^ 2^ cxp(— fcx) 

< a;(l - Ina;) - ln2 + + cxp(-A:x) ln2 

< a;(l - Ina:) + + (exp(-l/2) - l)ln2 



The last expression is negative for x < 0.05 (and k not too small). 
Case 3: 0.01 < x- < i - fc2-'^/2. We have 

h' (x) = — hix + ln(l — a;), 



q ix) = -771 17 7T TT > ^71 k = CXp -17 A: ). 

w 2*^ - 2 + 1 - a; - 2^-2 



23 



Hence, for 0.01 < x < ^ — k ^we have h'{x) + q'{x) > 0. Thus, h{x) + q{x) + hi 2 is monotonically 
increasing in this interval. Now, let a; = ^ — e for fc~^ < £ < k2~^/'^. Then 



h{x) 
q{x) 



In2-2e2 + 0(e3) 
(2'''ln2-c) 1^-2'"" + 2 
-2 In 2 + 2^-^ (c + In 2) + 0(4"'') 



2 ■^^--e 



-0(8 



Consequently, 



h{x)+q{x)+\n2 = -2e^ + 0{e^) + 0{2-'') < 0. 



Case 4: ^ + k2 ''/^ < x < 1. The function h{x) satisfies h{l — y) = h{y) for < y < 1/2. Furthermore, q{x) is 
monotonically decreasing. Therefore, for any a; > ^ + k2^^l'^ we have 



hi2 + h{x) + (7(.t) < ln2 + ( 2 ^ 



-fc 



q\--k2 



-fe 



< 0. 



In each case we have In 2 + h{x) + q(x) < 0. Thus, the assertion follows from (|47] | and Markov's inequality. 

7 The second moment 

Throughout this section we assume that r — 2^^ ln2 — p with p ^ | ln2 — Skfor some sequence Sk ~ Ofe(l) that 
tends to sufficiently slowly. We also assume that k > ko for a large enough constant ko > 3. We let d denote a 
signed degree sequence d chosen from D and we let m denote a vector chosen from Md- By Lemma 15.771 we may 
assume that |m(£) — 7^n| < v?/^ for all I. Let cr,T^ {0, 1}^ denote a pair of assignments chosen uniformly and 
independently from the set of all assignments with p-marginals. Finally, let ^ = k2~^/^. 



7.1 Outline 



The overlap of two assignments cr, r € {0, 1}^ is the vector 

1 



km'K{t) 



E 



lcr(;)=i ■ l-r(;)=i 



l<£L:pd{l)=t 



teT 



In words, 0{a, t) captures the fraction of occurrences of literals of each type t that are true under both a, t. Since 
cr, T are independent and have p-marginals, we have 



ter ■ 



Let Z" be the number of pairs {a, t) of p^-judicious satisfying assignments of ^d.m such that 



dist(o', t) e 



- - k'2-^'\ - + k'2-^'^ 
2 '2 



(48) 



Moreover, let Z' be the number of pairs (cr, r) of pd-judicious satisfying assignments of ^d,m such that 

\\0{a,T)-0*\\^<t 
Proposition 7.1 W.h.p. d, m are such that E [Z"(*d.m)] < E [Z'(*d,m,)] + o(l). 
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The proof of Proposition l7.1l can be found in Section rTSl Let Z denote the number of p-satisfying assignments of 
Furthermore, let Z signify the number of good p-satisfying assignments of In Section[8]we are going to establish 
the following. 

Proposition 7.2 V^.h.p. d, m are such that E [Z'(*d,m)] < C • E [Z{^d,m)f ■ 
Corollary 7.3 W.h.p. d, m are such that E [Z"^ d^m)] < C • E [Z{^d,rrt)f ■ 
Proof. Let Y be the number of pairs (cr, r) of good p-satisfying assignments of ^d,m such that 



2 '2 



(49) 



dist((T, r) ^ 

By definition, for any good a there are at most E [Z{^d.m.)] p-satisfying r such that ( |49] l holds. Therefore, 

E [r(*d,m)] < E [Z{^d,m)f . (50) 

Combining (|50] | with Proposition !?. Il and l7.2l we obtain for d chosen from D w.h.p. 

E[Z\^d.rr^)] < E [(F + Z") (*d,rr^ )] 

< E[{Y + Z'){^d,rr.)]+0{l)<{C+l)E[Z{^d,mt+o{l). (51) 

By Proposition 16.51 we have E ~ cxp(il(n)). Furthermore, Proposition 16.161 yields E [Z($d,Tra)] ^ 

E [Z{^d,m)]. Consequently, (EB implies E [Z^{^d.m)] < (C + 2)E [Z(*d,rr^)]^ as desired. □ 
The second part of Theorem l5.1l follows directly from Corollary 17. 3 1 

7.2 Proof of Proposition IH] 

We begin by relating the overlap to the Hamming distance. 

Lemma 7.4 W.h.p. d, m are such that for all pairs a,T £ {0, 1}^ satisfying A48i we have 



km 



Proof. By Lemma |5^ w.h.p. 



O = 



x<£V 



as claimed. □ 
Lemma 7.5 W.h.p. d, m are such that for any cr, r G {0, 1}^ that satisfy Il48i and that have p-marginals we have 

ilnPf(T,T e 5(*d)] < -21n2 + 0(fc2-'^). 
n 



25 



Proof. Much as in the first moment calculation in Section l631 here it is convenient to work with a different probability 
space. Namely, we let Cl be the set of all vectors (o'ij, )i£[„j] of 0/1 pairs. We define a probability distribution 
on O in which the pairs [aij , )ig [,„] [j.] are mutually independent random variables. For any i e [m] , j G [k] we 
let P [{d'ijjTij) = (a, b)] = q'^'^, where the parameters q"'' are chosen so that the following equations hold: 



= Oia,T), 
1 ^ 



km 
leL 



lb 

I 

,,fc=0 



Let (o-, t) denote a random pair chosen from this distribution. 
Proposition l6.5l and Lemma ensure that w.h.p. d is such that 



= i + 6(2-^-/'), (7"+gi" = 1 + 0(2^'^). (52) 

Thus, we may assume that ( |52] | holds. 
Let i? be the event that 

In addition, let S be the event that maxjg[fc] o-y = maxjgjfej Xy for all i £ [in]. We claim that 

P[<j,r eSi^d)]=P[S\B]. (53) 

Indeed, any d-compatible formula $ induces a pair ((t|$,t|<[,) G O defined by (7ij|$ = cr($ij), fy|$ = T($ij). 
Clearly, the distribution of the random pair ((t|,^^, t|,^^) is identical to the distribution of {&, t) given B. 

Due to independence, the probability of the event 5 is easy to compute. Indeed, with q = q^'^ + q^^ inclu- 
sion/exclusion yields 

F[S] = [l-2g^' + (l-2g + 
Furthermore, P [B] ~ cxp{o{n)) by the local limit theorem for the multinomial distribution. Hence, (l53T l yields 

- \nP [a, T eS(^d)] = -InPfS-lS] <o(l) + -lnl^ = o(l) + ilnP[S'] 
n ' n n p[B] n 

- rln [1 - 2q'' + (1 - 2q + q^^)''] < -r [2q'' - (1 - 2q + q^^)''] . 
Using ( |52] ) and simplifying completes the proof. □ 

Lemma 7.6 Let X > 2^*^ and t E T- For d chosen from D the following is true w.h.p. Let 7i" be the set of all pairs 
(y,T e {0, 1}^ such that |Of (cr, r) - 1/4| > A. Then 

|H"|<4"exp[-^ 



Proof. Let o-",t" G {0,1} be chosen uniformly and independently. Then E [©((cr", r")] = j. Furthermore,© 
satisfies the following Lipschitz condition. 
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lf(7',T",a",T" e {0,1} are such that there is a literaUo with T(/o) = i such that cr"(0 = (t'{1),t"{1) 
t'{1) for all I ^ {/o, ^^o}, then 



\Ot{cT"y')-Ot{a'y)\< 



2di, 



kmTr{t) 



Therefore, by Azuma's inequality for any A > we have 



> A 



< exp 



A^(fcm7rft))^ 



< 



exp 



X^njt) 
18~ 



where the last step follows from part 2 of Proposition|i 
Proof of Proposition ]?.!] Let H" be the set of pairs (cr, t) such that 

• cr, T satisfy ( |48] l and have p-marginals, and 

. \\Oia,r)-0*\\^>C 

Then by Lemma iTiSl and the second part of Proposition 16 . 5 1 w. h . p . (over the choice of d) we have 



\H"\ < 4" exp 



enit) 
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< 4" exp 



fc2 



36-2* 



Furthermore, by Lemma 173] w.h.p. (again over the choice of d) we have 

-0{k) 



P [cr, r e < 4-" exp 



2k 



for any (cr, r) e H" . 



Combining (|54] | and i55[ . we obtain that w.h.p. d is such that 



E[(Z"-Z')(*d)] < J2 PK^e5(*d)] < |i/"|4-"cxp 

(cr,r)eH" 



0(fc) 



2fc 



0(1). 



Therefore, the definition of the distribution Md entails that w.h.p. d is such that 

[E [{Z" - Z'){^d.m)]] = E [{Z" - Z')(*d)] = 0(1). 
Thus, the assertion follows from Markov's inequality. 



(54) 



(55) 



8 Proof of Proposition UJl 

We keep the notation and the assumptions of Section^ 

8.1 Overview 

For two assignments cr, r and a formula $ with signed degree distribution d we define a matrix 

w(cr,T,<I>) = (w£,j(cr,T,$))£g£_jg[fc] 

by letting j(cr, r, <!>) be equal to the fraction of clauses of type t whose jth literal is true under both cr, r. We 
call W() j(cr, T, $) the overlap matrix of cr, r in $. Recalling that cr, r denote two independent uniformly distributed 
assignments with p-marginals, we define ui ~ i^{cr, t, ^d.m)\ thus, a; is a random matrix. We use the symbol uj to 
denote (fixed, non-random) matrices u = {i^t.j)i(£c.jelk] with entries in [0, 1]. Furthermore, we let oj^ = {^e,j)je[k] 
denote the £-row of such a matrix uj. Finally, let u* = (w^ ^ ) be the matrix with entries ^ = £| for all i, j. 
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In addition, let S{i) be the event that both cr, r satisfy all clauses of type £ of $d,m- Let S = H^gc '^(^)- Further, 
let B{£, i) be the event that under both 



i.e., the fraction of clauses of type £ whose jth literal is true equals £j + 0{l/n). Let 

fl Bi£,j). 

eec.jelk] 

In Section |9] we are going to prove the following. 

Proposition 8.1 W.h.p. d, m are such that the following holds. Let C <Z C be a set of clause types and let S' 
1. For all oj — {iOij) such that \ujej — ujg A < k^^^ for all £ £ C , j d [k] we have the bound 



P[S'\uj = uj, B] < P[S'\u: = uj*,B]c^p 



di4-~'')Y,m{£)\\u^e^uj*A\l 



2. We have 



P[S\uj=uj*,B] < P[S'\uj = LJ*, B]cxp 



3. For any assignment a with p-marginals we have 



P [S\uj = uj* , B] < 0(1) • P [cr e Sp{^d,m)\<7 is p-judicious] 



For ll> = i(-0£j) define 0{u!) e [0, 1] by letting 



eec je[fc]:fj=t 



m{£)ujgj 
krm:{t) 



We also let w denote the matrix with entries luij ~ Oi (a;) for all £, j. We say that uj is compatible with O E [0,1 
if O = 0{ll!). In Section l872l we are going to prove the following. 

Proposition 8.2 W.h.p. d has the following property. For any uj — {(^e.j) such that 1 1 (cj) ~ -j 1 1 1 < 2^ we have 



T 



P[uj = oj\0(lj) = 0{oj),B] < P[uj = oj\0{Lj) = 0{oj),B]exp 



-fifc (1) • E W^^ ^ 



Recall that O* — {t^)teT- In Section [83] we will prove the following. 

Corollary 8.3 W.h.p. d, m are such that the following holds. For any O — {Ot)teT such that H© — < 2^ we 

have 



P[S\0{(t,t):^0,B\ < 0(1) •P[5|a; = cj*, S]exp 



d{2-'^)Y,n{t){Ot-o;r 

teT 
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In Section lOl we will show the following. 



Proposition 8.4 There exists a constant rj > such that w.h.p. d, m are such that the following holds. For all O with 
||0-il|l <2fwehave 

P[B\0{(t,t)^0] < i]-P[B\0{(7,t) =0*]. 
Furthermore, P [B\0{ct,t) = O*] = e(nl'^l-'^l^l). 

Recall that n{t) is the number of variables of type t G T. In Section[TO]we are going to prove the following. 

Proposition 8.5 W.h.p. d, m are such that the following holds. For all vectors A = {^t)teT '^'^'^ ll'^lloo — 1/8 we 
have 



P[V<er : |Ot(cr,r)-0;| > At] < exp 



-n-rifc(l)^7r(0A 



tGT 



Proof of Proposition^^ Suppose that O G [0,1] satisfies HO - 0*|1^ < ^. By PropositionlOw.h.p. 

V[S,B\0{<t,t) = 0] = P[S\B,0{(7,T) = 0]P[B\0{a,T) = 0] 

< j]-P [S\B, 0{(7, t) = 0]P [B\0{(t, t) = O*] . 



Furthermore, by Corollary [83] w.h.p 



P[S\0{(t,t) = 0,B] < 0{1) -PiSlLO = u*,B]exp 



Combining (|56] | and (|57] |. we see that 

P [S, B\0{(7, r) = O] < 0(1) • P [S\B, lj = uj*]-P [B\0{(t, t) = O* 



■ exp 



ter 



For an assignment a with p-marginals let 

b = P [a is p-judicious in ^d,m,] , s = P [a E Sp{^d,m)W p-judicious in $d.m] 
Then by part 3 of Proposition [STl Corollarv l8.4l and Corollarv l6.9l we have 

P [S\B, u; = a;*] • P [B\0{a, t) = O*] < 0(1) • (bs)^. 

Therefore, dSST l yields 



P[S,B\0{ct,t) ^O] < 0{l)-{bsfexp 
For a vector A = {Xt)t£T let 



ter 



(56) 



(57) 



(58) 



(59) 



h{x) = p[yteT ■.\Ot{cT,T)-o;\>Xt]. 

Moreover, for c ~ c{k) > a sufficiently large number let A ~ ^Z>q be the positive T-dimensional grid scaled 
by a factor of c/ y^- In addition, let h be the number of assignments a with p-marginals. Then by Proposition 18.51 
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and i5% there is a number C = > such that 



< 0(1) 



E [Z'i^d, 



< 



< 



< 



Oil) ■ J2 



<0(l)-^/i(A)cxp 

AgA 

riO(2-'^')^7r(t)A 



(6/is)2 
0(l)-^/i(A)exp 

^7r(t)A? 0(2-^-)-^^fc(l) 



iO(2-^)^7r(t)(A,+c/V^)2 



teT 



AeA 
0(1) • ^ exp 



AeA 



ter 



< 0(1)- ^ cxp 



17fc(l)^7r(OA? 

0(1) 



<0(l)-^cxp[-Cn||A| 



A6A 



Vll^!l2 



^exp [-Cc^z^] 



,2 = 



0(1) 



as desired. 



□ 



Notation for the proofs of Propositions 18. H48l4l It will be convenient to work with a different probability space. 
Namely, let ft be the set of all pairs {a, f ) of 0/1 vectors 



(CT,f) = {crij{£),nj{i))e^c,ielm{i)],je[k]- 



Let Bij c be the event that 



-L. ^ ,,^.(,).,^. and -L^ ^ .,(.).£, 

for all £ e C,j e [k]. Let = Cljelk] ^i.J ^"^^ ^^"^ ^ = Cleec 

To define a measure P on fJ, let q = (9" j)a,6e{o,i},fe£,jG[fe] be a vector with entries in [0, 1] such that 

1 

E„ab „ 1 01 _ 10 

9e,j - -L, lej - 1e,j 



a,b=0 



for all £,j. Define 
so that 



We define a measure P = Pg on as follows. 



1 - 2qi,j 



(60) 

(61) 
(62) 



For any £ = {£1, . . . ,£k) £ C, i G [m-(^)] and j g [k] independently we choose a pair of values 
{&,j{£),T,j{£)) e {0, 1}' such that 



for any a,b E {0, 1}. 
This probability space induces a random matrix d) = {ujij)ej with entries 

We will use the probability space (Ct, P) several times in the proof of the various propositions below, with various 
choices of q. 
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8.2 Proof of Proposition O 

Consider any lo = {uje,j) such that ||C'(w) — -jlH^ < 2^. We use the probability space (fi,P) with the vector q 
defined by 

qll^ = UJij, qij = ij for all £, j; 
the remaining entries of q are determined by (I60ll-(l62]l. Then the following is immediate from the construction. 

Fact 8.6 We have P [uj ^ u}\0{u;) = 0{u}), B]=P[lj = ^[©(a)) = 0{u}), B] . 
Now, 

P[w = w, e)(w) = Ci(w),B] ^ P[t2; = w, B] 



P[0{u}) = 0{uj), B] P[0{u}) =0{uj), B] 

P [B\u; = w] 



< 0(1 



P [O(cD) = 0{lo), B] 

PiBluj^u] 



P [cl> = 

• P [u; - , 



P [0{u:) - OiQ), B] 

The last step follows from the local limit theorem for the multinomial distribution because 



E 



JG[m(£)] 



= E 



for all £, j. Hence, 



P[(:J = u}\0{i:J) =0{u}), B] ^ P[w = a;] 



P[u}^ui\0{uj)^0{uj), B] - P[a; = a;]' 

For each £ E C, j G [k] the sum J2ie[m{e)] has a binomial distribution Wm{m{£),uj£j). Furthermore, 

these random variables are mutually independent. Therefore, Chernoff bounds yield 

P [c2> = w 



< cxp 



E mi^K^ej - il'ejf 



P = cj] 
whence the assertion follows. 

8.3 Proof of Corollary B 

Let u! be an overlap matrix such that O = 0{uj). Let C = C'{u)) be the set of all £ E C such that \uj( j — 1/4| < ^ for 
all j e [k]. Let S' = fl^gc, S{£). Then 

P{uj) = P[S',iJ = uj\0{uj) = 0,B] 

= P[S'\iJ = uj,B]-P[u}=uj\0{ij)=0,B]. 

Let 

P P[S'\uj ^uj*, B]-P[lj =0j\O{uj) =0, B]; 
observe that P depends on O but not on the specific choice of uj. Then by Propositions |8.1| and |8.2| 



P{uj) < P • exp 

< P ■ exp 

< P ■ exp 



\uji - ui'^Wl - fifc(l) \\uJi - ujeWl 
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By the second part of Proposition [83 

1, P[S'\uj = uj*, B] 
— In ■ 



n — ' kn 



n P [S\u) = uj* , B' 
Hence, letting P^P [S\uj = w*, 6] • P [a; = u!\0{uj) = 0, B], we obtain 



k — ' n 



P{uj) < Pexp 



To proceed, we note that 



jG[fc] fe£ 

E E E - ^*)' • 1^.=* = fc"^ E - ^* 



Thus, 



P(w) < Pexp 



^0(2-^) ^ 7r(t)(0: - O,)^ - E II 



teT i^c 
Summing over all possible overlap matrices uj of assignments with p-marginals, we get 

P = P{uj) = P [S'\0{uj) ^0,B]>P [S\0{uj) = O, B] 



which we can bound by 



P < P • exp 



teT 



E 



UJ( - iOll 



eec 



P[S\u:=Lu*, B] exp 



■ E 



iec 



P[uj = u\0{uj) = 0,B] 



< 0{l) -PiSluj =uj*, B]cxp 



teT 



as desired. 



8.4 Proof of Proposition 

Let u) be such that O = C(w) and — < We are going to work with the probability space (57, P) 

defined by letting 

We claim that there exist numbers < < (independent of uj) such that w.h.p. d is such that 

Cfc < nP [Pf.j I w = w] < 4 for all £, j. (63) 
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Indeed, given a>£.j = cj^^ the total number of indices i G [m-(^)] suchthat {a- ij{£),Tij {£)) = (1,0) has distribution 



1 - 

Therefore, the probability that the total number of such i equals its expectation is in the interval [ck,in~-^/^ ,Ck.2n^^^^] 
for certain Ck.2 > Ck.i > 0. Furthermore, given this event, the number of i G [m{i)] such that {a-ij {£) , Tij {£)) ~ (0,1) 
has distribution 



Bin \^{l~£j)mi£), 

Once more, the conditional probability that this random variable equals its expectation lies in [cfe ^n"^/^, cj, 471,"^/^] 
for certain Cfe.4 > Ck^3 > 0. Hence, setting Ck = CksCk,3 and c'j. = Ck.2CkA, we obtain (63[ . 

Summing ( |63] ) over all (finitely many) possible w with P [uj ^ lj] > and 0{uj) = O and invoking Proposition l8.2l 
we find that w.h.p. over the choice of d, 

P [Bij\0{i:b) ^O] ^ P [B£j|c2> = w] P [c2> = 

< 0(1/??,)+ P [Bij\LJ = uj]P [u = oj] 

uj:\\LU—Gj\\^<.n^'^/^ 

< o(l/n) + 4/?i<24/n. 

A similar calculation shows P [B£ j\0{Lj) ^ O] > ^Ck/n. As c^, c'j, are independent of the specific vector O, the 
assertion follows. 



9 Proof of Proposition I8A 



We keep the notation and the assumptions of Section^ 

9.1 Outline 

In Section |9^ we will establish the following. 

Proposition 9.1 There exist C^-functions Ve{-) that range over matrices uj = i^£,j)eec.jelk] such that 

l|w£-w^*|loo < for all £eC' 

with the following properties. 
1. For all such w we have 



P[S'\lj =u}, B] =exp 



2. For each £, Ve is a function of the row uJ( only. 

We need to analyse the functions Vi from Proposition |9T| Crucially, a;* turns out to be a stationary point. 
Proposition 9.2 The differentials of the functions Vefrom Proposition \9. ll satisfv DVi (w^) ~ Ofor all £. 



The proof of Proposition 19.21 can be found in Section 19.31 Furthermore, in Section 19.41 we derive the following 
bound on the second derivatives of Vg. 
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Proposition 9.3 The functions Vefrom Proposition \9.1\ ha\'e the following property. For any j, j', i we have 

^^^^ <0(4-) 



on the entire domain ofVi- 

Corollary 9.4 For any to in the domain ofV we have 

Vi{uje) < V{ujI) + 0(4"'^') - loI\\1 . 
Proof. This follows directly from Propositions 19.21 and 19 . 3 1 and Taylor's formula. □ 
Finally, in Section l931 we will show of Proposition 18.11 follows from Proposition 19 . II and Corollarv l9.4l 

9.2 Proof of Proposition |M] 

To construct the functions Ve, we are going to work with the probability space (f2, P) from Section [H] once more; we 
are going to define the vector q that determines the measure P so as to facilitate the definition of Ve in due course. Fix 

uj = {^e,])eec,]e[k] such that \\oje - ujg\\^ < fc~^^ for all £ g Let B' = f]f^^, Bi. Further, for ^ e £ and j e [k] 
let Ci^j be the event that iJi^j = ujij. Let d = r\je[k] ^Lj ^"'^ ^' ^ Cltec ^'t Finally, let S" = fl^ec' ^i^)- 
The following two facts are direct consequences of the definition of P. 

Fact 9.5 Ifq is such that P [B' n C] > 0, then P [■\B' n C] is the uniform distribution over the set B' n C". 

Fact 9.6 Suppose that q is such that the conditional distribution P [-{B' D C] is uniform. Then P [S"|-B', C] = 
V[S'\uj^uj,B]. 

Thus, our goal is pick q such that P [S'\B' , C'] is easy to compute. Roughly speaking, we are going to accomplish 
this by choosing q so that P [B' , C'\S'] is as big as possible. To implement this, we first need to determine the 
unconditional probabilities P [5"], P [B\ C] as functions of q. 

Lemma 9.7 Suppose that q is such that qij g (0, l)for all £ £ j G [k]. Then 

k k 

P [Sm - 1 - 2 - g,,,) + - J + ) (64) 



for all I € C',i € [m{l)\, and 



n ^ n 

tec 



k k 



Proof. The first statement follows by inclusion/exclusion. The probability that maxjg [^j {£) = equals 11^=1 (1 ~ 
qi^j) as the components a-ij{£) are the results of independent Bc{qej) experiments. For the event maxjgjj,] Tij{£) — 
we get the exact same expression. Furthermore, the probability of maxj^j^.] ^ij{£) ~ maxjgjj.] fij{£) ~ equals 
J|^^j^(l — 2q(,j + q)^j). To see this, note that for each individual j we have 

P [&,j{£) - f = 0] = 1 - 2qej + ql) 

by inclusion/exclusion, and these events are independent for j E [k] . The second one is due to independence over £ 
and i. □ 
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Lemma 9.8 For any q and any ^, j we have 

P [Qj] = P [Bin(TO(£), g]i ) = w,,jm(€) + 0(1)] 
Furthermore, ifq^^{i,j) < 1 then 



(65) 



P[S,,,|Q,,] = e(n-i/2).p 

Proof. Recall that C( j is the event that 



Bin (1 - LOe,j)m{e)^ 



^ ^.,(£).f,,(£)=c.,,,m(^) + 0(l). 

ie[m(^)] 

By construction, the random variables arij{£) ■ Tij{£) are independent 'Bc{ql^j) variables, and thus their sum has 
distribution Bin(m(£), q}^j)- Hence we get (|65] |. 

Furthermore, once we condition on the event C^j, the remaining (1 — ujg^j)m{£) pairs {crij{£),Tij{£)) are chosen 
conditional on the outcome being different from (1, 1). Hence, by construction each such pair takes the value (0, 0) 

with probabiUty ^ independently (with the numerator resulting from (l62t). In effect, the probability that the 



total number of (0, 0)s equals m{£){l — 2£j + uji,j) is just 



Bin {l-LUe^j)Tni£), 



1 - 2qe,j + q^ 



11 



m{£){l-2£j+uJi,j) + 0{l) 



Now, given that both this event and C^j- occur, the remaining 2{£j — Ld)m{£) pairs {a-ij{£),Tij{£)) come up either 
(1, 0) or (0, 1) with probability 1/2. By Stirling's formula, the probabihty that both outcomes occur an equal number 
of times is 6(ri^^/^). 

Note that 



□ 



P[B',c'] = l[P[B{£)nc{£)]=l[l[P[B{t„£)nc{t„e)] 

iec iec'j=i 
because under P the components of the vector {a-ij {£), Tij {£))e^i.j are independent. 
Lemma 9.9 There exists a vector q such that 



1 - 2nLi(i - <ii,h) + nLi(i - ^n^h + ' 

'iM 

1 - 2nLi(i - 9^./0 + nLi(i - + <k) ' 



(66) 



(67) 
(68) 



for all £ e J G [k]. This vector q satisfies 

= £j - 2-^-^ + 0(2-3fc/2) a„^gii ^ ^^^^ ^ 0(2-'=^). 
Proof. This follows from applying the inverse function theorem in a similar way as in the proof of Lemma 15. 101 □ 
In the rest of this section, we fix q as in Lemma W^ 



35 



Lemma 9.10 Let 



In 



1 - 2 - <ll,j) + 11(1 - 2q,,, + ) 

3 = 1 J = l 



■E 



Furthermore, let 



Then 



ij{qlj,ujej) + (1 - oje.j)-4' 



1 - 2(?£,, + q',^^ 1 - + uje,j 



1 - uje,. 



P C] = exp [n7'(a;) + 0(1)] 

Proof. The choice of q ensures that for any i and j, 

,11 



E 



ie[m(f)] 



5' 



(69) 



u:i^,m{l)- (70) 



indeed, by ( l64l ) the denominator in the middle term equals the probability of the event Siiji). Furthermore, by con- 
struction for any have 



P = l,T,,(l) = 0, Sm = I 1 - 11(1 - qe,,) = {q,., - gll) 1 - 11(1 " q,/, 



As a consequence, ( |67] | ensures that 



E 



ie[m(«)] 



By inclusion/exclusion, we obtain from (iTOl i and (ItTI i that 



£jm(£). 



(71) 



E 



5' 



(1 - 2lj +ujf^j)m{e). 



J2 (i-o-«,w)-(i~t,,w) 

Je[m(^)l 

Due to ( fTOl i and ( f72] i. a repeated application of Lemma l4.1l (the local limit theorem) yields 

F[B',C'\S'] = e(n"3''l^'l/2). 



(72) 



(73) 



Invoking Lemma [978] and using the large deviations principle for the binomial distribution (Lemma |4.2| |. we can 
easily determine the unconditional probability of B' nC: we have 

P[B\C'] = l[^K,]P[B',JCl,] 



-k\C'\/2 



e(^-3/c|£'|/2) 



exp 



•P 



Bin (1 -uji,j)m{£). 



1 - 2q( j + ql 
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1 - 



m(£)(l - 2tj+uji,j) 



■0(9",, + (1 - WfjOV" 



1 - 2qt,j + ql\ 1 - 2lj + c^,,,- 



1-^li 



1 - W£ , 
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Thus, 



lnP[S"|S',C"] = In 



P[S']P[B\C'\S'] 



P [B', C] 
= 0(l)+lnP[5'] 



^,3 



1 - 



The assertion follows by plugging in the expression for P [S"] from Lemma □ 
Finally, Proposition 19.11 follows from Fact |9.6| and Lemma |9.10l 

9.3 Proof of Proposition |931 

We start with the following observation. 

Lemma 9.11 Let q be the solution to i67\l and i68i for lo = lo*. There is 7 = ^{k) > such that for any e > and 
any i ^ CJ we have 

P[||a>f > el^f, Bf] < exp(-7e2n + o(n)) and (74) 
P[||cl>,-cj;||^ >e|B,] < exp(-7e2„ + o(n)). (75) 

Proof. Equation ( |73] ) from the proof of Lemma |9. lOl shows that 

P[B,|5',] =exp(o(n)). (76) 

Therefore, it is going to be sufficient to estimate P — > e | 5"^] . If we just condition on the event S^, then 

the fc-tuples (o-jj (£), -r^j of 0/1 parrs are mutually independent for all i £ [m{£)]. Furthermore, given 5^ 

modifying just one such fc-tuple can alter any entry uj( j by at most c/n, for some number c = c{k) > 0. Therefore, 
Azuma's inequality yields 

P - E [^i,,] I > e\S(] < 2cxp(-7£2n), (77) 

for some 7 = '-f{k) > 0. Since ( |68] ) ensures that E [oi^jS'^] = ujf, ( |74l i follows from ( |76] ), ( iTTl i and the union bound. 
To obtain dTSl l. let q' be the vector with entries q'^ j = p{£j) for all £, j. Then 

Pq, [Be] = cxp(o(n)). (78) 
Furthermore, applying Azuma's inequality just as in the previous paragraph, we find that 

Pg, [\uje^j - E [ujej | > e] < 2 exp(-7e2n) (79) 

for some 7 = j{k) > 0. Moreover, Eg/ [u)i] = ujf by the choice of q'. Thus, ( fTSl ) follows from ( fTHI ). ( |79] l and the 
union bound. □ 

Proof of Proposition \9.2\ Let I E C. Let q be the solution to ( |67] | and ( l68T l for w = cj*. Then P [ • \B'] is the uniform 
distribution over pairs (ct, t) E il such that (ct, f) e B'. Indeed, for oj = ui* the solution q to (l67b and (|68] | satisfies 
J = J for all £, j. Therefore, for any {a, f) e 51 we have 



P la = a, T = t] 



Since the sums J2e i j '^i.j (^) + '^i-j (^) coincide for all ct, f G B', dSOl l shows that P [ • is uniform. 
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Let H{La) be the number of pairs (ct, f ) G such (ct, f) € i?' and a>((7, f ) = w. We claim that 

-Dh\H{uj*) = o{l). (81) 

71 

This can be verified either by representing H{ljj) as a product of binomial coefficients and applying Stirlings formula 
or, alternatively, by using (|75] |. Indeed, assume that dSTl i is false. Then for small enough e > there is (5 > such that 
for some cj' with — '-^*\\^ ^ £ we have 

\TLH{uj')>5n+ max \nH{uj) (82) 
(with both 5 possibly dependent on k but not on n). Letting 

{a,f)eB' 

we obtain from dTSl l that 

1 ^ P[||u;-w*||^ <e/2|B'] = i J2 l|l-(a.f)--|U<e/2-i^^(*.f) 

(CT,f)eB' 

= exp(o(n))- max H{uj)/H. (83) 

l^:||w-ii)*j[^<e/2 



However, combining ( |82] | and ( l83l l we get 

P[||'^-t^*lloo > £/2|-B'] > H{uj')/H >cxp{Sn) max H{uj)/H 

uj -.Wuj — uj* \ \ ^<e 1 2 

> exp((5n - o(n))P - < £/2\B'] > 1, 

which is a contradiction. Hence, (ISTl i follows. 

Now, assume for contradiction that DVi{io*) ^ 0. Because the function Vi{-) remains fixed as n — > oo, there 
exists a fixed e' > such that \\DV(,{ijJ*)\\^ > e' . Therefore, dSTT l entails that for any e > small enough exist uj' , 
(5 > such that — ||^ ~ e and 

\nH{ijj') +n-'Pi{uj') > Sn+ max \nH{uj) + n ■ Veiuj), (84) 
with e, 5 independent of n. Let 

Hi= H{Lj{a,f))exp[nViiLj{a,f))]. 

{a,f)eB' 



Then by ( l74l i. 



5Z l||i'(£r,f)-c^-|U<£/2-^cD(5,f)exp[n-Pf(t2;(CT,f)) + 0(l)] 
{a,f)eB' 

exp(o(n)) • max H{uj) exp{nVe{uj))/ Hg. (85) 



However, combining ( I84] | and ( I85b we get 

i?(w')exp [n-p<?(a;') + 0(1)] 



> exp((5n) max H{lu) exp{nVe{i-o))/ Hg 

cj:||tj — tJ*||^<£/2 



> exp((57i-o(n))P[||tI>-t^*||^ < e/2\Se,B'] > 1. 
This contradiction shows that DVeioJ*) = for all £. 
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9.4 Proof of Proposition 19.31 

We need to compute the second derivative of Vi- In particular, we also need to differentiate q = q(oj) the solution 
to (|67]|-(|68]|. Furthermore, we fix some type £ E C for the rest of this section. Let We denote the set of all vectors ujt 
such that \uJej — j\ < k^^ for all j e [fc]. In Section l931 we are going to establish the following. 



Lemma 9.12 On We we have 



3ui 



doj, 



'Li 



= lh=^ + 0(2 



1e,h 



'4Lh 



due 



d{2 



duje^idujej 



0(2-'=), 

6(2-''). 



for any i, j G [fc]. 

We split the function Vt into a sum of various contributions; let 

fe fc 



(j)t{q) = In 



V'«j(w,(?) 
Lemma 9.13 On We we have 



1 - 2 [](! - + 11(1 - 2qe,j + q]],) 
i=i i=i 

X! ^^.J + i'ej (w, q) with 



and 



(1 - Loe.j)^^ 



1 - 2qe^j + q}]. 1 - 2£, + uje ■ 



l-q} 



11 





1 - UJt, 



duie,h duje.j 



< 0(4-*^) forall h,j e [k]. 



Proof. By Lemma|9l]for all w e we have \qej - 1/2| < 1/fc^ and - 1/4| < 1/fc^ for all j e [fc-]. For such 
vectors we obtain the bounds 
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= 0(2-^), 



J ' dqe.jdq, 



-,11. 



30qe,h oqijuqe^h 
d(f>e d'^(j)e 
dq^^'dqp^ 



0(4-'=) 



for all i, j, /i e [fc]. Therefore, the assertion follows from Lemma [43] (the chain rule) and Lemma |9. 121 
Let £ > 0. We say that vj/ e O^((0, 1)^, R) is e-tame on J C (0, 1)^ if the following conditions hold: 

Tl. For all y e (0, 1) we have *(y, y) ^ 0. 



□ 



T2. On y we have 
T3. On y we have 
T4. On y we have 



Z^i=l 



dzidzj 



< e for any j = 1,2. 



dzidzj 



< 100 for any i,j = 1,2. 



Let / : (0,1)'' {zi,...,Zk) >-> (/i(zi, . . . , z^), /2(zi, . . . , z^)) be a O^-function. We say that / is 

e-benign on W if the following statements are true on W: 
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Bl. 
B2. 
B3. 
B4. 



dfi _ dh 
dzi dzi 



< e. 



an 



dzj 



< £ for any 1 < j <k and i = 1 , 2 and 
< £ for any i and 7^ (1, 1). 



dfi. 



dzi 



< 100. 



dzhdzj 



dz'f dz'f 



< eand ||^| < 100. 



Lemma 9.14 There is an absolute constant C > such that the following is true. Assume that f is £-benign on W 
and that 5" is £-tame on /(W). Then on W we have 



dzidzj 

Proof. By Lemma |431 (the chain rule), we have 



< Ce^ for any i,j € [k] 



dzidzj 



E 

h=l 



dyh dzidzj 



E 

a.fc=l 



J^dfadh 

dyadyb dzi dz-j 



Since by T4 and Taylor's formula we have = Ok{£), B3 implies that for (i, j) 7^ (1, 1) 



E 

h=l 



dyh dzidzj 



Furthermore, as = Ok{£), B4 yields 



E 



dyh dzi dzi dyh 



/i=i 



dzi dzi 



^ dyh 



h=l 



the last step follows from T2 and Taylor's formula. 

To deal with the second sum, we consider four cases. 

Case 1: i ^ l,j ^ 1. By B2 we have < Ofcle^), and thus 

J^dfadh ^ 
dyadyb dzi dzj 

byT4. 

Case 2: i — l,j =^ 1. We have 



E 

a.b=l 



d'^ dfgdfb 
dyadyb dzi dzj 



E dfb \ - 



fc=i 



dz 



B1,T4 



^ dyadyb dzi 
df 



= Eo.(e)E 



6=1 



a/g 



b=l 
2 2 



^1 



T2 



6=1 



Case 3: i ^ 1, j = 1. The same argument as in case 2 applies. 
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Case 4: i = j = 1. We have 



dfadft 



y 

ab^i ^y^-^yb 



^ ) X! 



E 



9^1/ /^^(^yadyb ^^duadyb \_dzi dzi \dzi 



dfadft fdf. 



B2,T3 



i,b=l 
2 



9/a 

dyadyb dzi 



dzi dzi 



E 

a, 6=1 



dyadyb dzi 



dfa^dh 
dzi dzi 



6=1 
2 



;s ^y-^yb 



Bl 



52* a/iBi 



6=1 
2 



^^/a^yb dzi 



dyadyb dzi 



T2 



dyadyb 



a=l b=l - -'- a = l b=l 

Hence, in all cases we obtain a bound of Ok{s^)- 

Lemma 9.15 The functions {yi, 7/2) '—^ 2/2) ond {yi, 2/2) (1 ^ yi)'4'{yi: 2/2) ore 0{2~'')-tame on 



y 



(yi,y2) e (0, If : I2/1 - y2| < fc'2-^ max |y, - 1/4| < 

z— 1,2 



Proof. It is straightforward to work out the differentials of ip: we have 

dip _y2 l-y2 dj' 
dyi yi 1 - yi ' dy2 
d^ip y2 1 - y2 d^ip 



dyi 



2/2 

9 

2/1 



2/1 
1 



(1-2/iF 



dyidy: 



1-2/2 
1-2/1 



yi 1 - yi ■ 



9^2^ 



y2 1 - y2 



Differentiating once more with respect to yi, we get 

_ 2y2 2(1 - y2) 9^^/' 

9y3 



2/1 



(1 - ' ay?9y2 
Therefore, at yi = 2/2 + e the second derivatives work out to be 

1 1 



1 1 d^ip 

yl (l-yi)^' ^yi^yl 



0. 



7^3(^2 +e,y2) = +2e 

c'yi y2 1 - y2 



dyidy2 
5V 



1 



1 



(y2 + e, y2) = — 



1 



y2 1 - y2 



+ e - 



1 

(i- y2)2 
1 1 



y2^ 



(1 - 2/2)^ 



1 



„ 2 (y2 + e, y2) = 

9y2^ 



1 



!/2- y2 1 - y2 

Hence, -0 is tame. Furthermore, differentiating (yi, y2) i-^- (1 — y2)ip{yi,y2) yields 



_9_ 

ayi 
d 

dy2 
dyi 



(1 - y2)V'(yi,y2) 
(1 - y2)V'(yi,y2) 



-h;7^(i - y2)0(yi,y2) 



9 

(1 - y2)^V'(yi,y2), 
oyi 

(1 - y2)^^/'(yi, y2) - V'(yi, y2), 

dy2 
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^(1 - y2)0(yi,y2) = 
12 

■(1 - y2)V'(yi,y2) = 



(1 - y2)Tr3V'(2/i,2/2), 
02/I 

(1 ^ 2/2)^-0(2/1,2/2) - 2— ?A(yi,y2), 



'9y2' 



dyidy: 
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Hence, the fact that (1 — y2)ip{yi ,2/2) is e-tame follows from the fact that ip is. 
Lemma 9.16 With q = q{(^) the functions 







^ {(,1,1,3X2,1,]) = 



1 - 



4 Moo — J 



are 0{2 '^)-benign on W ^ {c 
Proof. The fact that is benign follows directly from Lemma |9.12| With respect to we have 

dC2,i,j _ 2(1 - £j) d^C2,i,j 4(1 - £,) 



dujf 



2,i,j 



duji,h 
dCi,e,j 



duj 



l,h 



(l-a;£j)2' duj- (l-a;£j)3 



0, 
(1 



2,e,j 



dujijiduji. 



2 ^g^.j I 



(h^j), 

+ g7 (l-2g,,+g,"^-) _ 2(l-,,,- ) 



Tn^ + o(2-^), 



4(1 - 

(1 - Qe,jr 



= 0(2-'=), 



0(2- 



= d{2 



Since jg^j — £j| < 0(2 and jq]^- — oje,j\ < (5(2 by Lemma the assertion follows. 
Finally, Proposition|93]follows directly from Lemmas |9?T3l l97T4l I9J3] and l9?T6l 



□ 



9.5 Proof of Lemma Kl2\ 

Let 



q^ 



i.h) 



1 - 2nLi(i - ^u.h) + nti(i - 29^,/. + 

%i 

1 - 2 nLi(i - m,h) + nti(i - 2gf,/. + ' 



A straightforward calculation shows that for q such that — 1/2| < 1/fc^ and — 1/4| < 1/fc^ we have 



dqi,h 



1,=H + 0(2-^-), 



dPf, 



dqi,h 



0(2-'=) 



dq'' 



' -0(2-^), 

1,=/. + 0(2-^-) 



e.h 



for any h e \k]. Let F : q 1-^ il!}'"''/']]"^'''''')- Then the differential of F satisfies 



DF = 



\ 9qe,h 
\ dqe,h 



he[k] 



he[k] 



( dPt,j 



he[k] 



= id + 0(2-*^)l 



(86) 
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where id is the matrix with ones on the diagonal and zeros elsewhere, and 1 signifies the matrix with all entries equal 
to one. By the inverse function theorem, we have D{F~^) = {DF)~^. Furthermore, by (|86] | and Cramer's rule, 

(DF)-^ =id + 0(2-'=)l. 



(87) 



Since q{uj) is the solution to F{q) = (''(^^^.')^^^['*')' *l87] i yields the assertions on the first derivatives 
Lemma |9.12| 

Proceeding to the second derivative, we highlight the following (folklore) fact. 

Lemma 9.17 Let e,S = cxp(— r2(fc)). Let A be the set of all k x k matrices A = (A^ ) such that {An — 1| < e for all 
i and | Ay | < S for all i ^ j. Then A is regular and the operator inv : A G A H- A^^ = {iiiVstA)s^t=i,...,k satisfies 



dinvs 



da. 



<0{S) 



(1 + 0(e)) for anyi,j,s,te [k]. 



Proof. This is a simple consequence of Cramer's rule. Indeed, let A'^j be the matrix obtained from A by omitting row 
i and column j. Then 

Thus, we need to differentiate det A[^ and det A. For any i ^ j we have 

d 



dan 



■det A 



l[ahh + d{S) = l + die) + d{S), 



dai 



■det A = d{5). 



Similarly, for i ^ j and s twe have 

d 



— det a;, 

aau 



U^f{l + 0{e)), — detA;, = 0((5), 

da.,i 



dai 



■detA;, = 0(5). 



Thus, the assertion follows from the quotient rule. 

A direct calculation shows that for q such that \q(j — 1/2| < 1/fc^ and 



□ 



■,11 



1/4| < we have 



d^ , 

hi,hdqu ' dqi^hdqW ' dql\dq^^ 



d^n. 



d^n, 



le,hdqe,i ' dqe^hdq}\ ' dqj\dq_ 



d^n^^j 
11 a^ii 



0(2-'=) 



for any ft,, i,j £ [k]. Thus, 



\D^F\\ <0(2-'=). 



(88) 



Because by the chain rule Z?(inv o DF) = (Z3inv) o (D^F), the assertion on the second derivatives follows from 
Lemma |9l7l ^ and 



9.6 Completing the proof of Proposition 18.11 

The first assertion is a direct consequence of Proposition |9. H and Corollarv l9.4| Similarly, the second assertion follows 
from Proposition |9T| because VAuj) < — fifc(2^'=) for all £. 

Finally, let w = u*. It is straightforward to verify that by letting qg j be as in Lemma l6.10l and by setting = qj ^ 
we obtain the unique solution to (|67Ti-(|68Tl. We need to plug this solution into V{'jj)'. we have 

k k 



hi 



k k 

1 - 2 - g,,,) + - 29^. + 4],) 

j=i J=i 



= In 



l-2j](l-g,J + [](l 

j=i J=i 
fc 




(89) 
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Moreover, 



Further, 



V-fe" = V(<zi„^ ■) = -24ln ( J:^ ) - (1 - 4) In ' ^ 



In 



'J 
l-£ 



1 - 



1 



In 



J / 



,2^ , A - 2^^,^- + 1 - + Wfj- 



1 - Wfj- 



w 



^ -<'-'^>^'°(r^)-<'-^?>'°(TTt)-^w-y 

Summing up we find 



ln( 



nV(uj) 



Therefore, the third assertion follows from Remark |6.14| 



(90) 



(91) 



10 Enumeration of Assignments with p-Marginals 



In this section we will prove Lemma and Proposition l8.5l Before we present the actual details we will introduce an 
appropriate framework, which will enable us to perform the enumeration of assignments with p-marginals, and pairs 
of such assignments with a given overlap. 

In Section|5]we said that an assignment a G {0, 1}^ has pa-marginals if for any type t £ T we have 



leL:Til)=t 



p{t)n{t). 



In words, the fraction of literal occurrences of type t that are true under a equals p{t) up to an error of 0{l/n). 
However, due to technical reasons and because it simplifies some of our calculations significantly, we will actually 
work with a slightly refined definition. Let us say that a signature (s,(i+,(i~) is good, if d^,dr < 3fcr/4 and 
< (d+ - d)2 < 100^2'^ hi fc. Instead of requiring that the fraction of literal occurrences of type t equals p{t), we 
require that this is true for every good signature. That is, we say that an assignment a G {0, 1} has p^-marginals if 
for any good s G T 

l£L:T{l)=s leL:T{l)=s 

and moreover, that fraction of literal occurrences of all other variables is 1/2, i.e.. 



E 



La(/) = 1 



di 



E 



leL:p{l) = l/2 



di_ 
km 



leL:p{l) = l/2 

We are going to prove Lemma 16.21 and Proposition 18.51 with this modified definition. It is easily checked that this 
modification does not affect any of the arguments in the previous sections. 



44 



Let s e T be any signature and set Ls = {£ e L : T{£) = s}. Moreover, denote by Vs = {\£\ : £ € LJ 
and observe that Vs — V^s- For any a E {0, 1}" let us denote by the s-weight Ws(cr) the number of satisfied literal 
occurrences, where only literals of signature s are considered, i.e., 

■Ws{cr) = ^ lia[e)=i]de. 

Let us also define similar quantities with respect to the types. Let t G T and set, as previously, Lt ^ {£ E L : T{£) = 
t}. Denote by G T the type satisfying p{^t) = 1 — p{t). Note that -^t exists, and we have L^t ~ ■ £ G Lt}. 
Moreover, note that if p{t) ^ 1/2 we have Lt n L-,t = 0, and Lt — L-,t otherwise. Finally, set Vf = : £ ^ Lt} — 
{\£\ : £ £ L^t}- In accordance with the case of signatures, let us for any a G {0, 1}" denote by the f-weight Wt{<T) 
the number of satisfied literal occurrences, where only literals of type t are considered, i.e.. 

Let tii2 be the type such that ^(^1/2) = 1/2. Since Lt^^^ = L^ti/2 follows that in this special case 

'"^tl/2(''') = X/ '^l<T(v) = l]dv + 'i-lcr{v)=0]d^v (92) 

With the above notation, an assignment a has p-marginals if and only if 

'^s eT\ti/2 ■■ Ws{a) =p{s)TT{s)km and wt^^^icr) = ^TT{ti/2)km. 

The next proposition is the first step towards the estimation of the total number of assignments with p-marginals, c.f. 
Lemma l672l We denote by = — .t ln.T— (1 — a;) ln(l — x) the entropy of a;, and with [z"]/(z) the n-th coefficient 
in the Taylor series expansion of an analytic function / around 0. 

Proposition 10.1 W.h.p. d chosen from D has the following property. There is a constant C > such that if we 
denote by S the set of signatures s € T with the property p{s) > 1/2, then 



|H| = (C + o(l))n-l'5|/2 



Proof. First of all, note that if for an assignment a and a signature s E T withp(s) > 1/2 we have Ws{<j) = ■K{s)km, 
then the fraction of variables in Vg that are set to true is p{s). Thus, the fraction of variables set to false is 1 — p{s) = 
p{-^s), and we infer that 

w^s{(j)^ ^ l[a(i)=i]di ^ ^ l[^{^)=o]d^v = p{-^s)T:{-^s)km. 

Consequently, for any such s the number of partial assignments CTs : 14 -> {0, 1}, with the property that the fraction 
of satisfied variables is p{s) is 



P{s)m) ^2np{s){l - p{s))\Vs\ 



Since w.h.p. d is such that \Vs\ = (1 + o{l))asn for some = a.s{k), this provides the exponential terms in ( |93] ). 

It remains to bound the number of partial assignments a' : Vt^^^ — ?> {0, 1} such that Wtj/, = ^'^{ti/2)km. Define 
the generating function 

F{z) = z^^'^M^'^ 
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By definition, the sought quantity is [2;'^(*i/2)'=™/2j^^2:). Moreover, the definition of F{z) and ( |92] l imply that 

n n (ik'(.)=i]2''" +i[a'(-)=o]^''^") 

The assertion follows. □ 
Lemma l6^ follows immediately from the next statement, which is shown in Section fTO.ll 

Proposition 10.2 W.h.p. d chosen fmm D has the following property. There is a constant C ~ C{k) > such that is 
we write N = | Vt^^^ |, then 

We proceed with the proof of Proposition [831 i e-, we want to enumerate pairs of assignments with p-marginals that 
have a specific overlap. Let s G T be a signature. For any cr, r G {0, 1}" denote the by the s-overlap Os(cr, t) the 
number of literal occurrences that are satisfied in both a and t, where we consider only literals of signature s, i.e.. 

Osier, t) = ^ l[a{e)=T{l)=l]dl- 

Similarly, for any type t E T we denote by ot {a, r) the number of satisfied literal occurrences in both a and t, where 
only literals of type t are considered. Note that Ot{(J, t) = 0{a, T)tTr{t)km, where O is defined in Section [TTl For 
the special case t = ti/2 it follows 

0*1/2 ('''i ''') = X/ l[a(i;)=r(u) = l]C?t) + l[o-(^-u)=T(^t))=0]- (94) 
"5^*1/2 

Let us begin with a simple observation. Let s E t such that p{s) > 1/2, and let a, r be two assignments with p- 
marginals. Note that if Ws (c, r) = (1 + S)p{s)'^Tr{s)km, for some (5 > — 1, then the fraction of variables in Vs that are 
set to true in a and t is (1 + 5)p{s)'^. Consequently, the number of variables that are set to false in both assignments 
is {l-p{s))\Vs\ - ipis)\Vs\ - (1 + S)p{s)^\Vs\), and therefore 

w-,s{(J, r) = (1 — p{s))n{^s)km — (p{s)n{^s)km — (1 + S)p{s)'^n{^s)krrij 

In words, the overlap in s determines the overlap in ^s. However, note that the s'-overlap, for any s' ^ s, ^s, is not 
affected by the quantities u>s(cr, r) and w^s{o', t). 

Let i S T be a type. With the previous observation at hand we are able to estimate the number of pairs of 
p-satisfying assignments with a given t- and ^t-overlap. The proof can be found in Section [l0.2l 

Proposition 10.3 There is a c > such that the following is true. Let e, e' > 0. Let t e T be a type such that 
p{t) 7^ 1/2. Denote by "Hf ^((e, e') the set of pairs a, r of assignments with p-marginals, such that 

\wt{(T,T) — p{t)'^TT{t)km\ > ep{tYiT{t)km and |zi;^t(cr, r) — p(^t)^7r(^t)fcm| > e'p{^t)^n{-^t)km. 

Then, 

\n^{e, e')\ < \n\^ ■ exp {-on {e^T:{t) + e'^Tr{^t)) } . 

What remains is to enumerate pairs of p-satisfying assignments with a given ti/2-overlap. The next proposition 
provides this number as the coefficient of an appropriately defined generating function. 
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Proposition 10.4 Let e G (—1/4, 1/4). Let TL'^^^i^) denote the set of pairs cr', r' of assignments to the variables \ 



Vtj^2 such that 



and 

^t^iM) = W'ti/slo-') = 7r(ti/2fcm)/2. 
Thennl/^ie) = [(xy)'^(*i/2)W2 ^t(i/4+e)u(ti/2)fc™]^(2,^ ^t)^ ^/jg^g 

u) = n {{xyuf- + {xyuf- + x^^'y"-- + x^-y^^) . 

Proof. Assign to a pair of assignments ct', r' to the variables in Vt-^^^ the weight x™*!/^ ^'^ ^ 2/"*^^^''^ ''^ \ 

Then, by using ^ and (|94li 

^ a;'"*l/2('^') y'"*l/2(^') y°'l/2('^'^^') 

<T',r':Vtj/3^{0,l} 

= X! n M'yiv)=r{.v) = l]{xyu)'^^ + l[a(v)=T(v)=0]{xyu)'^-^ 

+ l[CT(i)) = l,r(t))=0]2:'*"y''"" + l[CT('u)=0,r(i;) = l]2;'^""y'*"- 

Summing this expression up yields the claimed statement. □ 

The next statement provides the asymptotic value of the sought coefficients of F{x, y, u) from the previous propo- 
sition. The proof can be found in Section [T0.3l 

Proposition 10.5 W.h.p. d chosen from D has the following property. There is a constant C = C{k, e) > such that 
if we write N = \Vt^^^ \ and M = T:[ti/2)km, then 

[[xy)^"^ y, u) ^ {C + o(l)) • E ■ N~^^^, 

where 

^ ^ ^-(l-4e)M/2 -Q (2 + 2^-^" (95) 



and p is the solution to the equation 

2 + 2p'^-+'i-- ■ 



V4 + ^)M= E (96) 



In order to complete the proof of Proposition 18 . 5 1 we will estimate the exponential term in the previous statement as a 
function of e. Note that if e = 0, then clearly p = 1 and E = 4^. Let |e| < 1/100. We begin with providing bounds 
for the value of p from Equation Let fg{p) = g/{2 + 2p3), where g > 3. Then fg{l) = g/A, /^(l) = -g^/8 
and 

^■2 fgp2a-2 _ gpg-2 ^ pg-2 ^ p2g-2\ 



~ ■ 

Note that if < p < 1, then, with room to spare, \ fg{p)\ < g'^. Moreover, if p > 1, then we may estimate as 
follows: 
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Let us write p = 1^-5. Taylor's theorem then impUes that \fg{p) — (g/4 — g^(5/8)| < g^5^. By writing g„ = d^, + d-,^ 
and recalUng that M — y^Ki^v 9v we infer from 



2 ;:2 



^^5*3 < eA/ < + 5^S:i, where 5, = ^ gl, for i e {2, 3}. 

In view of these inequahties we might expect that whenever e is not too large, then 5 sa —e8M/S2- This can be made 
precise as follows. By solving the quadratic equations explicitly we infer that S satisfies 



1 -S2 + y/Sl - 2 5653"£M ^ ^ ^ I -S2 + V^5fT2565^eM 



16 S'a - - 16 S'3 

Note that d is such that w.h.p. ^2 = Q{krM) and S'3 = 0((fcr)^Af ). Thus, for sufficiently large k 



.2 , .... o , 256S3eA/ _ ^ 128^3eAf ^ ^ , ^3 



S"^ + 256S'3£A/ = SaWl 



^1 



^2 



^1 



The square-root with the minus sign can be estimated analogously. We infer that 

8M 



p = 1 + S, where S 



S2 



(97) 



With the approximate value of p at hand we can proceed with estimating the exponential term in ( |95] |. First of, we 
rearrange terms to obtain 



The bounds on p imply that 



(98) 



veVt 



1/2 



8M 



j < exp|-16e2__ + C'(e3(fcr)-iM)| 



(99) 



Regarding the last term involving the product in ( |98] l, we bound it by the following probabilistic considerations. Note 
that 

Let (<5't,)t,g\/tj^, be a family of independent random variables, which are uniformly distributed in { — 1, +1}. Then the 
last expression in the previous display is equal to the expected value of p {—1/2 s^gy}. We obtain 



:= E 



<2^P 

t>o 



Note that since either p*/^ > 1 or p"*/^ > 1 we may assume without loss of generality that p > 1. The advantage 
of the above formulation is that we can estimate rather easily the probability for a large deviation of the sum S ~ 
Tliv ^v9v- Indeed, if we change the value of any Sy to obtain a new sum S', then \S — S'\ = 2g„. By applying 
Azuma-Hoeffding we obtain 



Svgv\ = t 



< 
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Thus, by using ( |97] i and noting that e < due to our assumption p > 1 we obtain the bound 

Since the exponent is convex in t, it can easily be seen that it is maximized at t = 4A/|£|, where its value equals 

(4M|£|)^ , ,^, 4Af(4Af|£|) 
775 ^ \^\ 5 = 

. Q^2 Aj2 _e^2 Aj2 

Thus,^ = 0{\/N)e ^ , and bv combining (|98]l and (|99]l we infer that E < VNe Butsince52 = Q{krM) 

and M = Q{krN), this is at most -v/lV e"'^'^ ^, for some c > 0. 

Proposition l8.5l then follows immediately from Propositions ! 10. 3lllQ. 51 and the (aforementioned) observation that 
the t- and i'-overlap of cr, x are independent for t t' , -^t. 



10.1 Proof of Proposition [1021 

Set M — 'K[ti/2)km. By the virtue of Cauchy's integral formula we obtain 

/ := [z^'"^]F{z) = — i F{z)z-^"^-^dz. 

Since F is analytic in C, C can be any curve enclosing the origin. To estimate the integral we will use the saddle point 
method, which is commonly used to determine the asymptotic behavior of integrals that involve a large parameter, 
and are simultaneously subject to huge variations. For an excellent overview and numerous applications we refer the 
reader to ifTSl . 

The main idea is to choose C such that the integrand 'peaks' at a unique point, so that the main contribution to the 
integral comes from a small neighborhood of this maximum. We choose C to be the unit circle centered at the origin, 
i.e., C = {e*^ : -TT < 6* < tt}. Moreover, let 6*0 = 6*0(71) = TV^^/s^ ^j.jtg ^ |g»0 . ^ < Ooin)} for the 
restriction of C to the segment with \9\ < OQ{n). Then we may write / = /q + /i, where 

lo^—i F{z)z-''/^-^dz and h ^ — i F{z)z-''^^-^dz. 
27r« Jco 27ri Jc\Co 

By changing variables, the first integral becomes 

1^^ -L f " H{e)de, where H{e) = e'"'"^'/^ ■ FT (e'S-^^ + e^^''-"). (100) 

^e^'1/2 

Moreover, by using the trivial bound for complex integrals and the fact \z\ — 1 on C we obtain 

/i<27r- sup \F{z)\ (101) 

zec\Co 

Our subsequent proof strategy is as follows. We will first compute the asymptotic value of the integral over the 'central 
region' ; in particular, we show that 

/o = (c + o(l))7V-i/22^ (102) 

for an appropriate c > 0. Then, by using (llOlb we show that Ii = o(/o). The two statements combined yield then 
immediately the conclusion of the proposition. 

We proceed with showing (1102b . Recall that \6\ < 9a = N^^^^, and note that for any d, d', by applying Taylor's 
Theorem 

e'Sd^^ted' =2 + i{d + d')e - ^{d^ + d'^)e^ + 0(^{l + i){d^ + d'^)9^^ uniformly for all d, d' e N,\9\ < Oq. 
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Let us write 

= 

4 



S2 = j dl + dl^ + {d, + d^yf and S,^ J2 (di+di,). for j > 3. 



Observe that d is w.h.p. such that Sj = (1 + o{l))cjN for some Cj = c_,(fc) > 0, where 2 < j < 9. Using dlOOl i we 
infer that the integrand satisfies 



Hie) = er^'^'l^ • n ( 2 + ^^d, + d^v)B - Udl + d\)B' + o((l + + rf3^^^3 



= 2^ exp {-520' + O ((1 + ^)(530^ + ^40* + • • • + ^g^^)) } 
= (1 + o(l))2^ cxp I - 5'26'2|, since 61 < iV-2/^ 

Thus, 

(27r)Jo = I ° H{e)dB = (1 + o(l)) 2^ / " e~^^^^d9 

= (l + o(l))^= / e-" = (l + o(l)). 



This proves (|102| l. To complete the proof we will show that sup^g,^^,^^ l-P"!^)! is asymptotically negligible com- 



pared to /q. First, for any w G Vj^ 



/2 



Let us collect some basic properties of /„. Note that if d„ = d^y, then fv{d) = 2 for any —tt < 9 < tt. Otherwise, / 
is maximized for any 

„ , . f . 2n , \dv- d^y\ 



\di, — d-,v\ 2 
where f{e) = 2. 

For a pair (c?+, d_) e let Vd^^d- Q ^H/^ denote the set of variables v such that d^ — d+ and d^v — d-, and 
write A^d+,d_ = Then, 

\Fie'')\= n /-'(^)= n (2 + 2cos((?(d+-d_)))^=/'. 

s=(d+,d„) 

Note that J2s={d+ d = N . Thus, |-F(e*^)| < 2^ for all 0. However, this bound is achieved only if all factors are 
maximized simultaneously. We will argue in the sequel that if 16*1 e ((?o, tt), then a linear (in N) fraction of the factors 
is < 2 - 0(iV"''/^). It follows for some a > that 

\F{e'')\ < 2(1-")^ • (2 - 0(7V-4/5))aJV ^ . g-o(Jvi/=) ^ o(7V-i/22^) = o(/o). 

To see the claim, consider the specific pair {d'^,d'_) = {kr, kr — 1), and note that if k is sufficiently large, then 
fcr — 1 > kr/2 + lOV A:2'^ In k. So, indeed Vd^.d- ^ Furthermore, d is such that w.h.p. there is a constant 

a = a{k) > such that Nd^.d^ > ctN. It follows that for all variables v £ Vd' ,d' 



U9) = v/2 + 2cos((?). 

It can easily be verified that fy is monotone increasing for ~tt < 9 < and decreasing for < 9 < tt. Thus, for any 
1^1 € {9o, tt) we have fv{9) < ma.x{fy{9o), ^o)}- By using the Taylor series expansion of the cosine and the 
square root we obtain that 

fy{e) = 2 - — + 0(9'^), uniformly for all - tt < 6* < tt. 
We conclude that /„ (9) < 2 — 0{n^'^^^) for at least aN variables v, and the proof is completed. 
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10.2 Proof of Proposition HO 

We will exploit a concentration inequality due to McDiarmid ll26ll . We present it here in a simplified form that is 
appropriate for our purpose. Given a finite non-empty set B, we denote by Sym{B) the set of all |i?|! permutations 
of the elements of B. Let Si, ... , Bn be a family of finite non-empty sets, and denote hy Vl — Sym{Bi) x • • • x 
Sym{Bisi)- Moreover, let tt = (tti, . . . , ttn) be a family of independent random permutations, where tt^ is drawn 
uniformly from Sym{Bi). 

Theorem 10.6 Let c and r be positive constants. Suppose that /i : O — > R+ is such that for any tt € the following 
conditions are satisfied. 

• Ifir' can be obtained from tt by swapping two elements, then \h{TT) — h{TT')\ < c. 

• Ifh{TT) > s, then there is a set of at most rs coordinates such that hij:') > s for any tt' G that agrees with tt 
on these coordinates. 

Let Z = h{'K) and let m be the median of Z. Then, for any t > 



Let us proceed with the proof of Proposition [To3] We will assume without loss of generality that t is such that 
p{t) > 1/2. We will abbreviate p = p{t), q = p{^t). Let a be an arbitrary assignment with p-marginals. Moreover, 
denote by t an assignment that is obtained by selecting for any signature s E t uniformly at random plKsj variables 
from Vs and setting them to true, and setting all other variables in V \ Vt arbitrarily so that r has p-marginals. 
Equivalently, we may generate t by permuting the variables in Vs randomly, and setting the first 1141 variables in that 
permutation to true, for all s G t. With this notation we obtain 



The latter probability can be estimated with Theorem ll0.6l Indeed, note that 

• if r, r' have p-marginals and can be obtained by swapping the truth assignment of two variables, then 



• if Wt {(T, t) > s, then there is a set 5 of < s/ mm^^Vt dv < 2s /kr variables that are set to true, and any r' with 
p-marginals that sets all variables is S to true satisfies Wt{(T, r') > s. 

We thus may apply Theorem |10.6| with c = Akr and r = 2/kr. Moreover, trivially E [wt{(J, t)] < n{t)km. We infer 
that 




\nl^t{e,e')\ < \n\^ ■ P [\wt{cr,T)-p^n{t)km\ > £7r(i)fcm] 



I (ct, r) ~ wt{cr,T')\ < 2 max < Akr. 




Exactly the same argument, where we interchange the roles of t and -^t, shows that also 



< 4exp -9(1) 



{e'TT{-^t)kmf 
kr ■ 7r(^t)fcm 



) 



4exp(-e(l)e^7r(^0n) . 



The claim follows. 
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10.3 Proof of Proposition HO 

Set M = TT{ti/2)km. By applying Cauchy's integral formula we obtain 

{2Tny Jc, Jc^ Jc, 

The function F is analytic in C"^, implying that Ci , C2, Co can be any curves enclosing the origin. We choose 

Ci = {pe^' : \e\ < tt}, = {pe^'^ : |^| < tt}, = {p-^e*'^ : |^| < tt}, 



where p is the solution to the Equation (|96] i. Some remarks are in place here. The choice of the integration paths 
may seem arbitrary at this point. Note, however, that F is symmetric with respect to x and y, and thus it is natural 
to assume similar integration curves for them. Moreover, the choice of p is guided by the general principles of the 
saddle-point method and is such that the integrand has a unique maximum at (0, (p, ijj) = (0, 0, 0). Indeed, as we will 
show subsequently, the integrand is around (0, 0, 0) of elliptic type; this allows us to reduce the estimation of the main 
terms to the evaluation of a 3-dimensional Gaussian integral. 

Denote by C the restriction of the circles Ci, C2, Co to a small region around the origin, i.e., 

C = {pe'" : \e\ < iV-2/5| X {pe"^ • |<^| < Ar-2/5} ^ {p-^e'^ : |V| < N-^/^}. 
Then we may write / = /q + /i , where 

and Ii is the integral over (Ci x C2 x Co) \ C. By changing variables we obtain 

Io = T^ I H(e, ip, ^)d^dipde, where H = p- "'2''""' e-»^^^±|^-»^(i/4+£)M TT hJQ^^^^y 

(103) 



_JV-2/5^JV-2/5]3 "^^'1/2 



and 

hy{0,(p,ll>) = g*(S+V+'A)'ii' _^ ^i{e+f+tp)d^^ _^ pd^+d^^^ied^+itpd^^ pd^,+d^^ ^iBd^^+iipd^ ^ 

Regarding Ii , we will use the trivial bound 

/i<(27rf sup \H{x,y,u)\ (104) 

(a;,y,«)e(CixC2XCo)\C 

to show that Ii = o(/o). 

We begin with estimating Iq by providing an appropriate asymptotic expansion of it for points around the origin. 
First of all, note that for any v £ Vt^,^ we have /i„(0, 0, 0) = 2 + 2p''"+''-" and thus 

Moreover, 

— /i„(0, 0, 0) = —h,{0, 0, 0) = (2 + 2p'*"+'^-") + d^,), and 777/^.(0, 0, 0) = i{d, + d^^). 
ou dip 2 oyj 

The second derivatives at (0, 0, 0) are given by 
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Furthermore, the mixed second derivatives are 

^K = -idl + d\, + 2d.d^y^'^^-) and ^h.,, = ^K = -[dl+dl^). 

We will also need crude bounds for the third-order derivatives in order to establish an accurate approximation for H 
around the origin. Note that hy linearly exponential in 0, if, ip and d^, d-,v Thus, every time we take a derivative 
with respect to some variable, the norm of each single term in the expression of hy can increase by at most m„ = 
max{d,„,d^.J. Thus, uniformly for (61,^, ?A) G [-A^^/s^ ^2/5] 

we have that 



^3 

-h, 



<2(i + p<i.+d..)(rf^+rf^^)3^ where a, 6, 6 S {0, y^, V}- 



By using the uniform estimate 1 + x ~ QX~x^/2+e(x^) ^ where we set 1 + a; = hy{9, ip, ip) //i„(0, 0, 0) we infer that 

t (0,0,0) = + ^-^^^^ + '^^ + ' 2 + V?+^^> + 2"^^ + ^^^^^ 
where the 2nd order terms are 



8 V ^ 2(2 + 2p'^"+'^-)2 ^ 2(2 + 2p'i" 4_^4^d„+d^„v 

Finally, since [9, (p, G [— A^^/s^ A^^/sj jjjg e]-]-or term is of order at most [d^ + d^y)^N^^^^. In order to obtain an 
approximation for H we form the product over all v E Vt-^^^- Observe that the (linear in the variables) exponential 
factor e-''^i^+v)M/2-iip{i/4+E)M Q^Y\ce\s exactly with the first order terms in (1105b . By abbreviating 

_ (d, - _ (d. -d^,)2 + 2/"+'^-^(dg+d2j 



2(2 + 2pd 



and 



we obtain uniformly for any (0, ip, ip) G [— iV^^/s^ ^-2/5j3 

In (^^^ = -^e,e(^' + f^) - ^^.^V'' + Se,^e<l> - Sg,^{e + (^)V' + OiS^N-'^/''). 
Observe that d is such that w.h.p. all quantities S,,, and 5*3 are linear in N. Thus, we are left with computing 

/O (1 + o{l))E ■ [ f,-SeAe^+V^)-Si.,i,i'^+Se.4.Ov-Se,^{e+v)i,^^^^^0_ 

J[_Ar~2/5 7V-2/5]3 

In order to compute this integral we rescale each variable with N~^^^. By writing s... for S,_,/N we obtain 

/o = (1 + o{l))E ■ A^~3/2 . [ (.-'>e.e{0^+v'')-si,.^i>^+se,^ev-se.^{e+v)i,^^^^j^Q^ 

J[-iVi/io^Ari/iO]3 

A termwise comparison and elementary algebraic manipulations yield that 

^^9,0 ~ ^0,ip — and 2S^^^Sg^g — Sg .^ — S.^^^Sg^^ > 
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Thus, the squares can be completed and the integral in the above expression equals a constant depending on the family 
s.,.; this shows that asymptotically Ii is proportional to A^^3/2 . ^ 

In order to complete the proof we will use (|104| i to show that /i is asymptotically negligible compared to Iq. Recall 
the definition of H from (11031 ). It follows that the absolute value of H is given by 

^-(i-4.)M/2. -Q where M9,^,ij) = \h4e,ip,i;)\. 

Let us abbreviate Dy — dy — d^^. A lengthy calculation, which can be performed easily with the help of MAPLE, 
yields that 

fy{e, ^, ^ f^2 + 2p2K+<i^.) + 2cos {D,{e + ip + i!))+ 2p2(<i.+<i^.) cos - ^)) 

+ 2p''"+'*-" (cos (A.<^ + d^i^) + cos {D^e + d.^i) + cos [0.^9 - d-.^i/j) + cos {Dyip - d^.^V')) • 

Note that we can get an upper bound for if we replace all terms involving a cosine by one; this implies that 
1^1 < p^(i-4e)A//2 J^^(2 + 2p'^^~^'^^^) = E. Moreover, the bound is achieved only if all factors are maximized 
simultaneously, and this happens for example when we choose (6*, if, ip) = (0, 0, 0). We will argue in the sequel 
that if {9,ip,ip) G (C*! X ^2 X Co) \ C, i.e., at least one of the variables 9,ip,^ is assigned a value not lying in 
[-7V-2/5^ 7V~2/5]^ then there is a subset of variables V' C Vt^^^ such that \V'\ > aN for some a > and for all 
V e V it holds /.„(0, 0, 0) < /^,(0, 0, 0) - 0{N-^/^). Indeed, if this is true, then 

\H\ < p-a-4s)A//2 -Q (2 + 2p'^"+'*-) n (2 + 2p'*"+'*- - 0(7V-4/5)). 

Since p is bounded and d is such that w.h.p. d^ + d~,y — o(log n), it follows that \H\ smaller that E by an exponential 
factor, which shows with ( |104t that Ii = o{Io). 

To see that a set V' with the desired properties exists, let us assume that at least one of 9, ip, ip is in absolute value 
at least N^"^/^. For a pair [d^, d^) e let Vd^.d^ ^ denote the set of variables v such that d,, = d+ and 
d-,y — and write Nd^.d^ = \Vd^.d- \- Consider the specific pair {d^,d^) = (kr, kr — 1), and note that for 
all such variables we have Dy = 1. Furthermore, d is such that w.h.p. there is a constant P — P{k) > such that 
Nd+,d- > I3N. Then we may assume that 

for all V e Nd^.d- ■ .W, ^p, V') > (2 + 2p2fc'-i - 0{N-^'^)), 

as otherwise there is nothing to show. This impliesthat the arguments of all cosines appearing in the expression of /„ 
are close to multiples of 27r, and in particular, 

\9 + (t> + i'\, \9~ipl |(y9 + d+V'l = 0(7V-2/5) (mod27r); (106) 

this follows directly from the series expansion of the cosine around integer multiples of 2tt, which lack a linear term. 
Next, consider the pair {d'j^, d'J) = (fcr, kr — 2); again d is such that w.h.p. there is a constant /?' = P'{k) > such 
that Nd'^^d' > P' N . Note that for these variables we have Dy ~ 2. Then, as previously, we may also assume that 

for all veNd'^,d'_ : fv{9,^,ij) > (2 + 2p2fe--2 _ o(iv-4/5)), 

But then, by the same argument as above, \2(p + cJ^V'l = 0{N^^^^) (mod 2n). Since d-^ = d'j_ and, by assumption, 
\ip\ < TT, by combining this with the third term in ( |106l l, we infer that \tp\ — 0{N^^^^). In turn, together with the 
second term in ( 1106b . this impUes that also \9\ = 0{N^'^/^). Finally, the fact \9 + + '4)\ = 0(iV~2/5) (mod 27r) 
from ( 11061 ) then also implies that \5\ = 0{N~'^/^). Everything together yields that (6*, ip, t/i) e C, a contradiction. 
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11 Proof of Corollary E 



As a direct consequence of our second moment argument, the Paley-Zygmund inequality, and a concentration result 
on the number of satisfying assignments from HI we obtain the following. 

Proposition 11.1 For r as in (fl4] i we have > E • exp [--jiffc] w.h.p. 

We consider the following "planted model": let A = A/j(?i, m) be the the of all pairs (<!>, a) of /c-CNFs $ over 
V with m clauses and satisfying assignments a E Let Pa signify the uniform distribution over A; Pa is 

sometimes called the planted model. Moreover, let Pg be the distribution on A obtained by first choosing a random 
formula ^ and then a uniformly random cr G S{^) (provided that ^ is satisfiable); Pq is sometimes called the Gibbs 
distribution. Combining Proposition 11 1 . 11 with an argument from [], we obtain the following "transfer result". 

Corollary 11.2 For any B C A the following is true. If Pa [B] < exp [--JtIf], then Pq [B] = o(l). 



Thus, in order to show that some 'bad' event B is unhkely under Pg, we "just" need to show that Pa [B] < 
exp [— -p^] is exponentially small. 



Lemma 11.3 There is a number d 



6{k) > such that 
dist{a, a,naj) > ^ ^ ^ 



< 



exp 



2nr 



Proof. We can generate a pair ($, a) from the planted model as follows: first, choose a e {0, 1} uniformly; then, 
generate m clauses that are satisfied under tr uniformly and independently. Without loss of generality, we may assume 
that cr = 1 is the all-true assignment. We need to study the distribution d ~ {di )i£L of literal degrees. To this end, let 
(e/)ig L be a family of independent Poisson variables such that E [e/] = E [di] for all /. It is easily verified that there is 
C ^ 9(2-'^) such that 

E[d.,]^-{1 + C), E[d.,] = -(1-C) (107) 



for all X G V. Furthermore, if we let £ be the event that X];eL 
distribution as d. Moreover, 

P [£] = 17(71-1/2). 



km, then e = {ei)i^L given £ has the same 

(108) 



Let 



Y = 



1 V- 
n ^-^ 



1 



Viewing the difference e^, — e^^. as a random walk of length Po(fcr) and using limit theorems for resulting distribution 
(the Skellam distribution), we obtain from (11 07b that E [F] > i + Q(\/kr/2''). Further, applying Chernoff bounds to 
Y (which is a sum of independent contributions), we find that for a certain S = fl{^/kr /2'') 









p 


Y<l + 5 


< cxp 



f7(V/cr/2 ) n 



< cxp 



3nr 



(109) 



Finally, the assertion follows from dlOSt and ( |109t . 



□ 



12 Proof of Lemma 2.3 



The expected majority weight in $ is easily computed. In for each x the numbers d^, d^x of positive/negative 
occurrences are asymptotically independently Poisson with mean kr/2. Therefore, for any d = Q{kr) we obtain 



E[|d, 



d] = y^2d/n + Okil). 
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In effect, 



E K„a,(*)] ~ 77 + \/ ^ + Okil/kr). 



(110) 



By comparison, given that, say, the all-true assignment is satisfying, the number of positive occurrences has 
distribution Po((l+l/(2'^— l))fcr/2), while d^x has distribution Po((l— 1/(2'^— l))fcr/2). The normal approximation 
to the Poisson distribution yields for d ~ Q{kr), 



E [\dx - 1 1 e dx + d^, = d] = y^2d/^ + e(4-^-d3/2) + Ok{l). 

for a certain constant c > 0. Consequently, 



E K„,,(*) I 1 G Sm + + m-'ikrm 



(111) 



Both with and without conditioning on 1 G w„ioj enjoys the following Lipschitz property: changing one 

single clause can alter the value of Wmaj by at most k/{km) = l/{rn). Therefore, Azuma's inequality yields 



E [w„ 



> A] < 2exp 



P[|w;„,j-E[u;™,,]| > A|le5(*)] < 2cxp 
In effect, for a certain constant ^ > we have 



{rXnf 
2m 





rX^n 


— 2 cxp 


2 



< cxp [-n (fc/4'=) n] 

< cxp[-n{k/4'')n] 



(112) 
(113) 



Combining (II 12l l and ( 11131 ) with a simple counting argument yields Lemma 2 from the extended abstract. 
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